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Abstract. We consider two- player games played in real time on game structures with 
clocks where the objectives of players are described using parity conditions. The games are 
concurrent in that at each turn, both players independently propose a time delay and an 
action, and the action with the shorter delay is chosen. To prevent a player from winning by 
blocking time, we restrict each player to play strategies that ensure that the player cannot 
be responsible for causing a zeno run. First, we present an efficient reduction of these 
games to turn-based (i.e., not concurrent) finite-state (i.e., untimed) parity games. Our 
reduction improves the best known complexity for solving timed parity games. Moreover, 
the rich class of algorithms for classical parity games can now be applied to timed parity 
games. The states of the resulting game are based on clock regions of the original game, 
and the state space of the finite game is linear in the size of the region graph. 

Second, we consider two restricted classes of strategies for the player that represents 
the controller in a real-time synthesis problem, namely, limit-robust and bounded-robust 
winning strategies. Using a limit-robust winning strategy, the controller cannot choose an 
exact real- valued time delay but must allow for some nonzero jitter in each of its actions. 
If there is a given lower bound on the jitter, then the strategy is bounded-robust winning. 
We show that exact strategies are more powerful than limit-robust strategies, which are 
more powerful than bounded-robust winning strategies for any bound. For both kinds 
of robust strategies, we present efficient reductions to standard timed automaton games. 
These reductions provide algorithms for the synthesis of robust real-time controllers. 



1. Introduction 

Timed automata |AD94j are models of real-time systems in which states consist of discrete 
locations and values for real-time clocks. The transitions between locations are dependent 
on the clock value s. Timed automato n games, introduced in |MPS95j . and explored fur- 
ther in |dAFH+03l lAdAF051 ICDF+051 lFTM02bl IFTM02a| (amongst others), are played by 



two players on timed automata, e.g., a "controller" and a "plant" for modeling real-time 
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Figure 1: A timed automaton game 7. 



controller synthesis problems. We consider timed automaton games with w-regular objec- 
tives specified as parity conditions. The class of u;-regular objectives can express all safety 
and liveness specifications that arise in the synthesis and verification of reactive systems, 
and parity conditions are a canonical form to express w-regular objectives [Tho97| . The 
construction of a winning strategy for player 1 in such games corresponds to the controller- 
synthesis problem for real-time systems [DM02, MPS95, WH91] with respect to achieving 
a desired w-regular objective. 

Timed automaton games proceed in an infinite sequence of rounds. In each round, both 
players simultaneously propose moves, with each move consisting of an action and a time 
delay after which the player wants the proposed action to take place. Of the two proposed 
moves, the move with the shorter time delay "wins" the round and determines the next 
state of the game. Let a set $ of runs be the desired objective for player 1. Then player 1 
has a winning strategy for <3? if it has a strategy to ensure that, no matter what player 2 
does, one of the following two conditions holds: (1) time diverges and the resulting run 
belongs to <3?, or (2) time does not diverge but player-l's moves are chosen only finitely 
often (and thus it is not to be blamed for the convergence of time) [dAFH+03llHP06] . This 
definition of winning is equivalent to restricting both players to play according to receptive 
strategies [AH97l lSGSAL98j . which do not allow a player to win by blocking time. 

In timed automaton games, there are cases where a player can win by proposing a 
certain strategy of moves, but where moves that deviate in the timing by an arbitrarily 
small amount from the winning strategy result in a strategy that does not ensure winning 
any more. If this is the case, then the synthesized controller needs to work with infinite 
precision in order to achieve the control objective. As this requirement is unrealistic, we 
propose two notions of robust winning strategies. In the first robust model, each move of 
player 1 (the "controller") must allow some jitter when the action of the move is taken. 
The jitter may be arbitrarily small, but it must be greater than 0. We call such strategies 
limit-robust. In the second robust model, we give a lower bound on the jitter, i.e., every 
move of player 1 must allow for a fixed jitter, which is specified as a parameter of the game. 
We call these strategies bounded-robust. The strategies of player 2 (the "plant") are left 
unrestricted (apart from being receptive). We show that (1) general strategies are strictly 
more powerful than limit-robust strategies; and (2) limit-robust strategies are strictly more 
powerful than bounded-robust strategies for any lower bound on the jitter, i.e., there are 
games in which player 1 can win with a limit-robust strategy, but there does not exist any 
nonzero bound on the jitter for which player 1 can win with a bounded-robust strategy. 
The following example illustrates this issue. 
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Example 1.1. Consider the timed automaton 7 in Fig. [TJ The edges denoted a\ for 
k G {1,2,3,4} are controlled by player 1, and the edges denoted a\ for j € {1,2,3} are 
controlled by player 2. The objective of player 1 is D(-iZ 3 ), i.e., to avoid the location Z 3 . 
The important part of the automaton is the cycle I , I . The only way to avoid Z 3 in a time 
divergent run is to cycle between Z° and Z 1 infinitely often. In addition, player 1 may choose 
to also cycle between Z° and I 2 , but that does not help (or harm) it. Due to strategies being 
required to be receptive, player 1 cannot just cycle between Z° and Z 2 forever, it must also 
cycle between Z° and I 1 ; that is, to satisfy □( _, Z 3 ) player 1 must ensure (DOZ ) A (DOZ 1 ), 
where DO denotes "infinitely often". But note that player 1 may cycle between Z° and Z 2 
any finite number of times as it wants between an Z°, Z 1 cycle. 

In our analysis below, we omit such Z°,Z 2 cycles for simplicity. Let the game start from 
the location Z° at time 0, and let Z 1 be visited at time t° for the first time. Also, let a? 
denote the difference between times when Z° is visited for the (j + l)-th time, and when Z 1 is 
visited for the j-th time. We can have at most 1 time unit between two successive visits to 
Z°, and we must have strictly more than 1 time unit elapse between two successive visits to 
I . Thus, a 3 must be in a strictly decreasing sequence. Also, for player 1 to cycle between 
Z° and Z 1 infinitely often, we must have a 3 > for all j as the (j + l)-th a\ transition must 
always happen after the j-th a 2 transition. Consider any bounded-robust strategy. Since 
the jitter is some fixed ej, for any strategy of player 1 that tries to cycle between Z° and Z 1 , 
there are executions where the transition labeled a\ is taken when x is less than or equal 
to 1 — ej, and the transition labeled a 2 is taken when y is greater than 1. This means that 
there are executions where a 3 decreases by at least ej in each cycle. But, this implies that 
we cannot have an infinite decreasing sequence of a 3 's for any e- } and for any starting value 
oft . 

With a limit-robust strategy, however, player 1 can cycle between the two locations 
infinitely often, provided that the starting value of x is strictly less than 1. This is because 
at each step of the game, player 1 can choose moves that are such that the clocks x and y 
are closer and closer to 1. A general strategy allows player 1 to win even when the starting 
value of x is 1. The details will be presented later in Example 14.91 in Subsection 14.21 O 

Contributions. Our contributions are two-fold: we present improved complexity results 
to solve timed automaton parity games, and we present two notions of robust winning in 
timed automaton parity games and present solutions of them. 

Improved complexity. We first show that timed automaton parity games can be reduced 
to classical turn-based finite-state (untimed) parity games. Even though the timed games 
are concurrent, in that in each turn both players simultaneously propose moves before one 
of the moves is chosen, our reduction to the untimed finite-state game is turn-based. The 
concurrency in timed games is limited (only in proposal of time), and we exploit this in 
the reduction to obtain turn-based games. In general the reduction of concurrent games to 
turn-based games is only known for Biichi and coBiichi objectives (and only for qualitative 
analysis) [JKH02]. The turn-based game we obtain as result of the reduction has a state 
space that is linear in the number of clock regions. There is a rich literature of algorithms as 
well much ongoing research to solve finite-state turn-based parity games, and our reduction 
allows all these algorithms to be used to solve timed automaton parity games. A solution 



for timed automaton games with parity objectives was already presented in dAFH + 03] and 
the solution works in 

O ((M ■ \C\ ■ \A,\ ■ \A 2 \) 2 ■ (\S* Reg \) d+2 ^ 
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time, where M is the maximum constant in the timed automaton; C is the set of clocks; Ai 
is the set of player-z edges; L is the set of locations; d is the number of priorities in the parity 
index function; and S Reg is the set of states in the region graph of the timed automaton 
expanded to handle receptiveness and we have \S Reg \ = \SR eg \ ■ 32 • |C| ■ d, where SR eg is the 
set of regions of the timed automaton (which is bounded by \L\ ■ Y\ x£ c( c x + 1) • \C\\ • S c \, 
with c x being the maximum constant that clock x is compared to in the timed automaton 
game). We show that timed automaton games can be solved in 

0((|5Se g |-(|A 1 | + |il 2 |)).(| 1 SSeg|.8)^ + ^ 

time. Our reduction has two steps: first we show that certain restrictions can be applied to 
strategies without changing the winning set; then we show that the timed game with clocks 
under the strategy restrictions can be transformed into a finite-state turn-based game with 
8- \S Reg \ states, 0(|<SR eg | • (\A±\ + | A2 1 ) ) edges, and d+2 priorities. Our improved complexity 
follows from the above reduction, and the fact that a finite-state turn-based parity game 
with m edges, n states and d parity indices can be solved in 0(m • ns + 2) time [Sch07] . 
The restriction to receptive strategies is handled by our reduction with the following two 
modifications: (1) the number of regions of the timed automaton parity game needs to be 
enlarged by a factor of 32 • |C| • d, and (2) the number of indices of the parity function needs 



to be increased by 2. The modifications are similar to those in dAFH + 03 . 
Robust winning. Second, we show that timed automaton games with limit-robust and 
bounded-robust strategies can be solved by reductions to general timed automaton games 
(with exact strategies). In the reduction for limit-robust games, the jitter is controlled by 
player 1, as the jitter is only required to be greater than 0, with no other restriction. For 
bounded-robust games, the jitter is controlled by player 2, as there is an uncertainty interval 
of constant length ej (which is fixed for the game). The reduction for the limit-robust case 
is obtained by changing the winning condition so that moves are only to states where all the 
clock values are non-integral. The reduction for the bounded-robust case is by a syntactic 
transformation of the game graph. The limit-robust game can be solved in time 

O f (15^1 -(lAil + l^l)). (15^1 -16)- 

Given a rational valued jitter of e = f^, the bounded-robust game can be solved in time 



d+2 j 1 
3 '2 



d+2 1 1 



O (\S* Re J ■ \A,\ 2 ■ \A 2 \ ■ \C\ ■ e n ■ e i P +2 ) ■ (\S Reg \ • 32 ■ \C\ ■ \A,\ • e n ■ ejf 1+2 ) 3 ~ 2 



'Reg I ' 1^11 • l^2| • |^l ' fc n • t d J ' y\ J Reg 

The reductions provide algorithms for synthesizing robust controllers for real-time systems, 
where the controller is guaranteed to achieve the control objective even if its time delays 
are subject to jitter. The question of the existence of a non-zero jitter for which a game 
can be won with a bounded-robust strategy remains open. 

Comparison to the preliminary version [CHP08J. Our present submission extends and 
improves upon the results of [CHP08J. The state space of the finite game of the reduction 
in |CHP08] had size 0(|5p eg | -\C\-M ■ \Ax\); the present finite-state game has size 0(\S R \). 
The correctness proof of the present reduction is based on several new non-trivial results 
that are not present in [CHP08J. 
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Related work. Timed automaton games have been explored before for controller synthesis, 
e.g, in [PAMS981 IDM021 IMPS95j IWH91] . In most previous work, time-divergence has not 
been handled properly. For example, the formulations of [PAMS981 IFTM02bl [JT071 ITri09j 
assume that the syntactic structure of the game is such that it is not possible for the players 
to block time. The work of [AH97] looks at safety objectives, and correctly requires that 
a player might not stay safe simply by blocking time. It however also requires that the 
controller achieve its objective even if the opponent blocks time, and hence is deficient for 
reachability objectives. In |DM02j the authors require player 1 to always allow player 2 
moves, thus, in particular, player 2 can foil a reachability objective of player 1 by blocking 
time. The strategies of player 1 are also assumed to be region strategies by definition, that 
is, player-1 strategies can only specify regions in its moves (and not an exact desired state), 
and can only be dependent on the history of the observed regions in a play (and not on the 
history of exact states in a play). A solution for timed automaton games with receptive 



strategies and parity objectives was first presented in dAFH + 03 , where the solution is 
obtained by first demonstrating that the winning set can be characterized by a //-calculus 
fixpoint expression, and then showing that only unions of clock regions arise in its fixpoint 
iteration. 

It has been recognized by many researchers that an important shortcoming of timed 
automata is that clock values are assumed to be available as real numbers with infinite pre- 
cision. Our notion of bounded-robustness is closely related to the Almost-ASAP semantics 
of [WDR05] . The work there is done in a one-player setting where the controller is already 
known, and one wants the know if the composition of the controller and the system satisfies 
a safety property in the presence of bounded jitter and observation delay. A similar model 
for hybrid automata is considered in [2T04]. The solution for the existence of bounded 
jitter and observation delay for which a timed system stays safe is presented in [WDMR04 . 
Various models of robustness in timed automata (the one-player case) are also considered 
in [ATM051 IBMRMI IGH.T971 IHROO] . 

Outline. In Section [2] we present the definitions of timed game structures, objectives, 
strategies, and review some basic results for timed automaton games. In Section [3] we 
present a restriction of strategies of the two players, which does not change the winning set, 
and which allows an efficient reduction to finite-state turn-based games. We also derive the 
complexity bound obtained from the new reduction to finite-state games. In Section H] we 
define limit-robust and bounded-robust strategies, and show how winning sets for both can 
be computed by reductions to timed automaton games. 



2. Timed Games 

In this section we present the definitions of timed game structures, runs, objectives, and 
strategies in timed game structures. 

Timed game structures. A timed game structure is a tuple 9 = (S, A\, A2, Ti, T2, S) 
with the following components. 

• S is a set of states. 

• A\ and A2 are two disjoint sets of actions for players 1 and 2, respectively. We assume 
that {J-i,±2, -U} n Ai = 0, and write A± for A x U{JLi, 1*}, and A^ for A 2 U{_L 2 }- The 
set of moves for player i is M{ = IR>o X Af. Intuitively, a move (A, a*) by player i 
indicates a waiting period of A time units followed by a discrete transition labeled with 
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action a«. The move (A, _Lj) is used to represent the move where player-i just lets time 
elapse for A time units without taking any of the discrete actions from Ai. The action 
_L* is used to represent the fact that player 1 is relinquishing control to player 2 in the 
given stage of the game. 

• Tj : S \— > 2 Mi \ are two move assignments. At every state s, the set Tj(s) contains the 
moves that are available to player i. We require that (0, _Lj) G Tj(s) for all states s £ S 
and i G {1, 2}. Intuitively, (0, _U) is a time-blocking stutter move. 

• 5 : S x (Mi U M 2 ) 1 — y S is the transition function. We require that for all time delays 
A, A' G !R>o with A' < A, and all actions G A^~, we have 

(i) (A, a~) G Ti{s) iff both (A', !_;) G Y^s) and (A - A', a;) G Ti(5(s, (A', ± i ))) ] and 

(ii) if S(s, (A', Li)) = s' and 5(s' , (A - A', Oj)) = s", then S(s, (A, Oj)) = s" . 

The game proceeds as follows. If the current state of the game is s, then both players 
simultaneously propose moves (Ai,ai) G Ti(s) and (A2,a2) G ^(s). If a\ 7^ _L*, the move 
with the shorter duration "wins" in determining the next state of the game. If both moves 
have the same duration, then the next state is chosen non-deterministically. If a\ = _L*, then 
the move of player 2 determines the next state, regardless of Ai. We give this special power 
to player 1 for modeling convenience as (1) the controller always has the option of letting the 
state evolve in a controller-plant framework, without always having to provide inputs to the 
plant, and (2) it allows a natural model for systems where controller actions are disabled in 
certain mo Formally, we define the joint destination function <5jd : S x M\ x M2 h-> 2 
by 

{{6(s, (Ai,a x ))} if Ai < A 2 and ai / _L*; 

{6(8, (A 2 ,a 2 ))} if A 2 < Ai or 01 = _U; 

{5{s,{A 2 ,a 2 )),5(s,(A 1 ,a 1 ))} if A 2 = Ai and 01 + U- 

The time elapsed when the moves mi = (Ai,ai) and m 2 = (A2,a 2 ) are proposed is given 
by 



, . / min(Ai,A 2 ) if a x ^ _L* 

delay(mi,m 2 ) = | ^ if fll = 



The boolean predicate blarneys, mi, m.2, s') indicates whether player i is "responsible" for 
the state change from s to s' when the moves mi and m2 are proposed. Denoting the 
opponent of player i by ~i = 3 — i, for i G {1, 2}, we define 



blarneys, (Ai, oi), (A 2 , a 2 ),s) 



(Ai < A 2 A 5(s,(A 1 ,a 1 )) = s') A («l ± U) if i = 1 
(A 2 < Ai A 5(s, (A 2 , a 2 }) = s') V/ ifli = -U) if » = 2 



Runs. A ran r = sq> { m i> m 2)' s i; ( m ii m 2); • • • of the timed game structure 9 is an infinite 
sequence such that £ S and m^ G T^Sfc) and s^+i G <5jd(sfc, mf, m^) for all k > and 
i G {1,2}. For k > 0, let time(r, fc) denote the "time" at position k of the run, namely, 
time(r, A;) = Y^jZo delay(m{, m ] 2 ) (we let time(r, 0) = 0). By r[k] we denote the (k + l)-th 
state Sfc of r. The run prefix r[0..fc] is the finite prefix of the run r that ends in the state s^. 
Let Runs be the set of all runs of 9, and let FinRuns be the set of run prefixes. 
Objectives. An objective for the timed game structure 9 is a set $ C Runs of runs. We 
will be interested in parity objectives. Parity objectives are canonical forms for w-regular 
properties that can express all commonly used specifications that arise in verification. 



*We illustrate the usefulness of _L* in Example 12.21 The results of this paper do not change if _L* is not 
present in the framework. 
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Let Q : S h4 {0, . . . , A;— 1} be a parity index function. The parity objective for Q requires 
that the maximal index visited infinitely often be even. Formally, let Inf0ften(f2(r)) denote 
the set of indices visited infinitely often along a run r. Then the parity objective defines 
the following set of runs: Parity(f2) = {r | max(lnfOften(f2(r))) is even }. A timed game 
structure 9 together with the index function Q constitute a parity timed game (of order k) 
in which the objective of player 1 is Parity . 

Strategies. A strategy for a player is a recipe that specifies how to extend a run. Formally, 
a strategy 7Tj for player i G {1,2} is a function 7Tj that assigns to every run prefix r[0../c] a 
move m, in the set of moves available to player i at the state r[k]. For i G {1,2}, let IT 
be the set of strategies for player i. Given two strategies tt\ G IT and 1x2 G II2, the set of 
possible outcomes of the game starting from a state s G S is the set of possible runs denoted 
by Outcomes(s, 7Ti, 7T2). 

Receptive strategies. We will be interested in strategies that are meaningful (in the 
sense that they do not block time). To define them formally we first present the following 
two sets of runs. 

• A run r is time-divergent if lim/^oo time(r, k) = 00. We denote by Timediv the set of all 
time-divergent runs. 

• The set Blameless^ C Runs consists of the set of runs in which player i is responsible 
only for finitely many transitions. A run s$, (m®, m^), s±, (m\, m\), . . . belongs to the 
set Blamelessj, for i = {1,2}, if there exists a k > such that for all j > k, we have 
-1 blamej(sj, m\, m J 2 , Sj + \). 

A strategy 7Tj is receptive if for all strategies ir^i, all states s G S, and all runs r G 
Outcomes(s, 7Ti, 7T2), either r G Timediv or r G Blameless^. Thus, no matter what the 
opponent does, a receptive strategy of player i cannot be responsible for blocking time. 
Strategies that are not receptive are not physically meaningful. A timed game structure 
9 is well-formed if both players have receptive strategies. We restrict our attention to 
well- formed timed game structures. We denote 11^ to be the set of receptive strategies for 
player i. Note that for tt\ G Ilf,^ G II^, we have Outcomes(s, tti, 7^) C Timediv. 

Winning sets. Given an objective let WinTimeDiv^($) denote the set of states s in 9 
such that player 1 has a receptive strategy tti G 11^ such that for all receptive strategies 
7T2 G 11^, we have Outcomes(s, m, 1T2) Q The strategy iri is said to be a winning strategy. 
In computing the winning sets, we shall quantify over all strategies, but modify the objective 
to take care of time divergence. Given an objective let 

TimeDivBli(<I>) = (TimedivR $) U (Blamelessi \ Timediv) 

i.e., TimeDivBli(<I>) denotes the set of runs such that either time diverges and $ holds, or 
else time converges and player 1 is not responsible for time to converge. Let WinS($) be 
the set of states in 9 such that for all s G Win^^), player 1 has a (possibly non-receptive) 
strategy tti G H\ such that for all (possibly non-receptive) strategies iT2 G II2, we have 
Outcomes(s, tti, ^2) ^ The strategy n\ is said to be winning for the non-receptive game. 
The following result establishes the connection between Win and WinTimeDiv sets. 

Theorem 2.1 ([HP06]). For all well-formed timed game structures 9, and for all uj -regular 
objectives <I>, we have Win^(TimeDivBli(<I>)) = WinTimeDiv^($). 

We now define a special class of timed game structures, namely, timed automaton 
games. 
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Timed automaton games. Timed automata [AD9 4] suggest a finite syntax for spec- 
ifying infinite-state timed game structures. A timed automaton game is a tuple 7 = 
(L, C, A\, A2,E, 7) with the following components: 

• L is a finite set of locations. 

• C is a finite set of clocks. 

• A\ and A<i are two disjoint sets of actions for players 1 and 2, respectively. 

• BCLx (Ax U A 2 ) x Constr(C) x L x 2 C is the edge relation, where the set Constr(C) of 
clock constraints is generated by the grammar 

9::=x<d\d<x\^9\8 1 Ae 2 

for clock variables x E C and nonnegative integer constants d. 

For an edge e = (l,ai,Q,l',X), the clock constraint 6 acts as a guard on the clock values 
which specifies when the edge e can be taken, and by taking the edge e, the clocks in the set 
A C C are reset to 0. We require that for all edges {l,<n,0',l',\') + (I, a[, 6", I", A") G E, 
we have 7^ a\. This requirement ensures that a state and a move together uniquely 
determine a successor state. 

• 7 : L 1— > Constr(C) is a function that assigns to every location an invariant for both 
players. All clocks increase uniformly at the same rate. When at location I, each player i 
must propose a move out of I before the invariant j(l) expires. Thus, the game can stay 
at a location only as long as the invariant is satisfied by the clock values. 

A clock valuation is a function k : C 1— > !R>o that maps every clock to a nonnegative real. 
The set of all clock valuations for C is denoted by K(C). Given a clock valuation k E K(C) 
and a time delay A E lR>o> we write k + A for the clock valuation in K(C) defined by 
(k + A)(x) = k(x) + A for all clocks x E C. For a subset A C C of the clocks, we write 
k[X := 0] for the clock valuation in K(C) defined by (k[A := 0])(x) = if x E A, and 
(k[A := 0])(x) = k(x) if x A. A clock valuation k E K(C) satisfies the clock constraint 
6 E Constr(C), written k \= 9, if the condition 8 holds when all clocks in C take on the 
values specified by k. A state s = {I, k) of the timed automaton game 7 is a location / E L 
together with a clock valuation k E K(C) such that the invariant at the location is satisfied, 
that is, k (= 7(7). We let S be the set of all states of 7. 

In a state, each player i proposes a time delay allowed by the invariant map 7, together 
either with the action _Lj (player 1 can also propose _L*), or with an action ai E Ai such 
that an edge labeled a>i is enabled after the proposed time delay. We require that for all 
states s = (1,k), either 

(a) k + A \= 7(Z) for all A E JR> , or 

(b) there exist a time delay A E IR>o and an edge (1,02,0,1' , A) E E such that 

(1) U2 E A 2 and 

(2) k + A\=6 and for all < A' < A, we have k + A' \= <y(l), and 

(3) (« + A)[A:=0] |=7(0- 

Informally, these conditions ensure that, for a legal state, either the invariant at the location 
is satisfied at all time points in the future; or there is some time point in the future at which 
a discrete action can be taken by the plant (with the location invariant being satisfied up 
to that time point). This requirement is necessary (but not sufficient) for well-formedness 
of the game. 

The timed automaton game 7 defines a timed game structure [T| = (S, A±, A2, Ti,T2,S) 
as follows: 
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f r 

x > A 
o* 



a < A —¥ ci2 

Figure 2: Player- 1 actions disabled for A time 
units (X* absent in model). 



x < A — > a2 

Figure 3: Player- 1 actions disabled for A time 
units (J_* present in model). 

• 5 = {(I, k) I Z G L and k(Z) satisfies 7(Z)}. 

• For i 6 {1,2}, the set Ti({l,K.)) contains the following elements: 

(a) (A, _Lj) if for all < A' < A, we have k + A' \= 7(Z). 

(b) (A, if for all < A' < A, we have k + A' \= 7(Z), a, € Aj, and there exists an 
edge (Z, aj, 0, Z', \) E E such that k + A |= 9. 

(c) (0,-U) if i = 1. 

• The transition function 5 is specified by: 

(a) $(«,<A,-U)) = s 

(b) *((Z,«),<A,_L i )) = (Z,« + A). 

(c) 6((l,K),(A,ai)) = (1',{k + A) [A := 0]) for the unique edge (Z, a*, 0,/', A) G £ with 
k + A |= 0. 

The timed game structure [T] is not necessarily well-formed, because it may contain cy- 
cles along which time cannot diverge. Well-formedness of timed automaton games can 
be checked in EXPTIME [HP06]. We restrict our focus to well- formed timed automaton 
games in this paper. 

Example 2.2 (Utility of the action -L*). Suppose we did not have the action _L*. Then 
in every timed game, we need to require that from every location, there needs to be an 
outgoing player-1 edge before the invariant of the location expires; this requirement needs 
to be present as now player 1 cannot simply relinquish active control to player 2. 

Consider a timed automaton game 7 with a location I in which we want to model the 
following. We want that if a clock condition 9 is met at location I, then the moves of the 
controller (player 1) are disabled for the next A time units (the disabled time interval is 
right open); and that player 2 (the plant) is to take a particular action a' 2 within these 
A time units. Let x be a clock used just for guarding the A condition. The system can 
be modeled as in Figure [2j The incoming player-1 and player-2 edges to I 1 have the same 
guard and reset condition. Without the _U action, we need special player-1 and player-2 
actions a\ and a\ which go to a dummy sink accepting location from the location Jr. The 
objective $ of player 1 also needs to be modified to $ A (— iOZ*) (where O is the standard 
LTL operator for reachability). With the implicit presence of the relinquishing action 
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the situation can be modeled more naturally as in Figure [3] without having to add dummy 
sink locations and edges; or having to change the objective. Note that replicating the a' 2 
action to a similar player-1 action from l\ does not work in a _L*-less model, as we want the 
time at which the a' 2 action is taken to be under the control of player 2. □ 

Clock regions. Timed automaton games can be solved using a region construction from the 
theory of timed automata [AD94]. For a real t > 0, let frac(i) = t — \ t\ denote the fractional 
part of t. Given a timed automaton game T, for each clock x G C, let c x denote the largest 
integer constant that appears in any clock constraint involving x in 7 (let c x = 1 if there is no 
clock constraint involving x). Two states k\) and (li, Ki) are said to be region equivalent 
if all the following conditions are satisfied: (a) l\ = h, (b) for all clocks x, K\(x) < c x iff 
k>2(%) < c x , (c) for all clocks x with K\{x) < c x , [ni(x)\ = [k2(x)\, (d) for all clocks x,y 
with k\(x) < c x and Ki(y) < c y , hac(ni(x)) < frac(Ki(y)) iff frac(«2(a;)) < frac(K2(y)), 
and (e) for all clocks x with Ki(x) < c x , frac(«i(x)) = iff frac(«; 2 (a;)) = 0. A region is 
an equivalence class of states with respect to the region equivalence relation. There are 
finitely many clock regions; more precisely, the number of clock regions is bounded by 

\L\-IUec<P* + l)-\C\l'#W. 

Representing regions. A region of a timed automaton game T can be represented as a 
tuple R = {I, h,V(C)) where (a) I is a location of 7; (b) h is a function which specifies the 
integral part of clocks h : C — > (IN PI [0, M]) (M is the largest constant in T); and (c) V(C) 
is a ordered disjoint partition of the clocks (C_x, Co, . . . C n ) such that WCj = C, with C{ ^ 
for i > 0. Then, a state s with clock valuation k is in the region corresponding to R when 
all the following conditions hold: (a) the location of s corresponds to the location of R; 
(b) for all clocks x with k(x) < c x , [k(x)\ = h(x); (c) for k(x) > c x , h(x) = c x ; (d) for 
all pair of clocks (x,y), with k(x) < c x and n{y) < c y , we have frac(«(a;)) < frac(«(y)) iff 
x € Ci and y G Cj with < i < j (so, x,y G Ck with k > implies frac(K(x)) = frac(«;(y))); 
(e) for k(x) < c x , frac(«(x)) = iff x G Co; and (f) x G C_i iff k(x) > c x . 
Region strategies and objectives. For a state s G S, we write Reg(s) C S for the clock 
region containing s. For a run r, we let the region sequence Reg(r) = Reg(r [0]), Reg(r [1]), 
Two runs r, r' are region equivalent if their region sequences are the same. An w-regular ob- 
jective $ is a region objective if for all region-equivalent runs r, r', we have r G $ iff r' G <F A 
strategy tti is a region strategy, if for all runs ri and r2 and all A; > such that Reg(ri[0..fe]) = 
Reg(r 2 [0..fc]), we have that if 7Ti(ri[0..fe]) = (A,ai), then 7ri(r 2 [0..A;]) = (A',ai) with 
Reg(ri[/c] + A) = Reg(r2[/c] + A'). The definition for player 2 strategies is analogous. Two 
region strategies tv\ and ir[ are region-equivalent if for all runs r and all k > we have that 
if 7Ti(r[0..fe]) = (A,ai), then 7ri(r[0..A;]) = (A',ai) with Reg(r[/c] + A) = Reg(r[/c] + A'). A 
parity index function Q is a region (resp. location) parity index function if fl(si) = fi(s 2 ) 
whenever Reg(si) = Reg(s2) (resp. 81,82 have the same location). Henceforth, we shall 
restrict our attention to region and location objectives. 

Encoding time- divergence by enlarging the game structure. Given a timed au- 
tomaton game T, consider the enlarged game structure 7 (based mostly on the construction 
in [dAFH + 03] ) with the state space S T CSx EWi) x {true, false} 2 , and an augmented 

transition relation 5 7 : S 7 x (Mi U Mjj) h-> S 7 . In an augmented state (s,3, tick, bl\) G S" T , 
the component s G <S is a state of the original game structure [T], 3 is the value of a ficti- 
tious clock z which gets reset to every time it hits 1, tick is true iff z hit 1 during the last 
transition, and bl\ is true if player 1 is to blame for the last transition (i.e., blamei is true 
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for the last transition). Note that any strategy in in [T], can be considered a strategy in 7. 
The values of the clock z, tick and bl\ correspond to the values each player keeps in memory 
in constructing his strategy. Given any initial value of 3 = 3*, tick = tick* , bl\ = bl*; any 
run r in 7 has a corresponding unique run r in 7 with r[0] = (r[0],j*, tick*, bl*) such that r 
is a projection of r onto 7. For an objective we can now encode time-divergence as the 
objective: 

TimeDivBli($) = (DO tick -> $) A (->□<> izc/c -4 <>□-. 6Z1) 
where □ and O are the standard LTL modalities ("always" and "eventually" respectively), 
the combinations DO and On denoting "infinitely often" and "all but for a finite number 
of steps" respectively. This is formalized in the following proposition. 

Proposition 2.3. Let 7 be a timed automaton game and 7 be the corresponding enlarged 
game structure. Let $> be an objective on 7. Consider a runr = s°, (m°, m°), s , {m\,m^), . . . 
in 7. Let r denote the corresponding run in 7 such that 3 = 0, tick = false, bl° = false 
andr= (s°,3°, tick , bl }, (m^m ,), (s 1 ^ 1 , tick 1 , bl{), (m\,m\),.. . . 
Then, r G TimeDivBli($) iff r G ((DO tick ->• $) A (-.DO focifc ->• OD-i 6/1)). 

Proof. Time diverges in the run r iff it diverges in the corresponding run r. Moreover, time 
diverges in r iff time crosses integer boundaries infinitely often, i.e., □<> tick holds. Also, the 
run r belongs to Blamelessi iff player 1 is blamed only finitely often, i.e., <>□-> bl\ holds. □ 

The following lemma states that because of the correspondence between 7 and T, we 
can obtain the winning sets of T by obtaining the winning sets in 7. 

Lemma 2.4. Let 7 be a timed automaton game and 7 be the corresponding enlarged game 
structure. Let $ be an objective on 7. Any state s of 7 satisfies s € WinJ(TimeDivBli(<I>)) 

iff (s,0, false, false) g Winf ((DO tick ->• $) A (-.DO tick ->■<>□-. bh)). 

Proof. Consider a state s of 7, and a corresponding state (s, 0, false, false) of T. The 
variables 3, tick and bl\ only "observe" properties in T, they do not restrict transitions. 
Thus, given a run r of T from s, there is a unique run r of 7 from (s, 0, FALSE, FALSE) and 
vice versa. Similarly, any player-i strategy 7Tj in 7 corresponds to a strategy 7r j in T; and 
any strategy 7Tj in T corresponds to a strategy Hi in T such that both strategies propose the 
same moves for corresponding runs. The result then follows from Proposition 12.31 □ 

Encoding TimeDivBli(Parity(fi)) as a parity objective. If $ is a parity objective, then 
TimeDivBli(<I>) can be specified as a parity objective in a related game structure T^ 1 . The 
following encoding is based on a construction in [dAFH+03] . Given $ = Parity (0) where 

is a parity index function of order d, the structure T^ td has the state space S*^" C 
S x IR[o.i) x {true, false} 2 x {0, 1, . . . , d — 1}. Given a state s = (5,3, tick, bl\,p), the set 

of available moves Tj n (s) is equal to Tj(s). The transition relation is specified as 

follows. For (A, Oj) G Fj(s), we have S^ n ((s,$, tick, bl\,p), (A, a^)) = (s 1 ,3' ,tick! , bl' x ,p') 
where 

• s' = 5 7 (s, (A, a*)). 

• 3' = frac(3 + A). 

• tacA;' = true iff 3 + A > 1. 

• bl[ = true iff i = 1 (i.e., its a player-1 move). 
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P' 



max(p, 0(s')) if tick = false 
ft(s') if tick = TRUE 

The following lemma states that a parity index function f2 td can be defined on 7^ td 
such that the time divergence conditions are encoded. The proof of the lemma is technical 
and is presented in the appendix. 

Lemma 2.5. Let 7 be a timed automaton game, Parity(f2) an objective on 7, and 7^ td 
the corresponding enlarged game structure. Consider the parity index function J7 td for 7 ntd 
defined as 

{0 if tick = bl\ = false 
1 if tick = FALSE, Mi = TRUE 

p + 2 if tick = true 

Extend the parity function $7 to states of 7^ td , such that the parity Q((s,i,tick,bli,p)) is 
the same as the parity O(s) in 7. Then in the game structure 7^ td , we have 

TimeDivBli(Parity(T2)) = ((DO tick ->■ Parity(Sl)) A (->□<> tick -> OCH 6^)) = Parity(ft td ). 

□ 

The next lemma states that we can consider games on T^ td with the parity index 
function Q td to obtain the winning states of 7 for the objective Parity (Q). The proof of the 
lemma follows from the results of Lemma 12.41 (the result of the lemma also holds for the 
structure T^ td ) and Lemma [ 



Lemma 2.6. Let 7 be a timed automaton game, £1 a parity index function on states 
of 7, 7^ c the corresponding enlarged game structure, and Q td the parity index function 
on states of 7^ td defined in Lemma Iff, 51 Let $7 be extended to states of 7^ td such that 
tick, bl\,p)) = f2(s). Any state sof7 satisfies s G Win^(TimeDivBli(Parity(f2))) 

iff (s, 0, FALSE, FALSE, 0) G Winf 2 ((DO tick -> Parity(^)) A (-.DO tick OCH bh)) 

iff (s, 0, false, false, 0) G Win^ n (Parity(ft td )). □ 

Let ac be a valuation for the clocks in C = Cu{z}. A state of T^ td can then be considered 
as tick, bl\,p). We extend the clock equivalence relation to these expanded states: 

({l,K),tick,bh,p) = ((V ',2'), tick' ,bl[,p') iff / = I', tick = tick! ,bl x = bl[,p = p' and k = k'. 
We let {I, tick, bl x ,p) be the "locations" in . If Q is a location parity index function for 
7, we have O td to be a location parity index function for 7^ 

A //-calculus formulation for describing the winning set. A //-calculus formula ip 
to describe the winning set Win[ (Parity(O td )) is given in [dA FH+03| . The //-calculus 
formula uses the controllable predecessor operator for player 1, CPrei : 2 s i— > 2 s (where 
S = S^" ), defined formally by 



s G CPrei(Z) iff 3mi G T{ (?) Vm 2 G (s) . 5j d (s, m l ,m 2 ) C Z. 

Informally, CPrei (Z) consists of the set of states from which player 1 can ensure that the 
next state will be in Z, no matter what player 2 does. The operator CPrei preserves regions 



of T^ td (this follows from the results of Lemma 13. 1 [) . It follows from dAFH + 03 that given 



TIMED PARITY GAMES: COMPLEXITY AND ROBUSTNESS 



13 



a parity index function Q : S t- > {0, 1, . . . , d— 1}, the winning set Win[ (Parity(O)) can be 
described by the following //-calculus formula in case d — 1 is odd: 

Q- l (0) nCPrei(Yo) 



U 



fi-^nCPreipTi) 

u 

^- 1 (2)nCPrei(y 2 ) 



fi- 1 ((i-2)nCPrei(y (i _ 2 ) 

u 

n _1 (d- 1) nCPrei(X d _i) 

where ^ and f denote the least fixpoint and the greatest fixpoint operators respectively, and 
denotes the set of states of parity j. In case d — 1 is even, the /i-calculus formula is 

n~ l (Q) nCPrei(Yo) 

u 



vY d _i ■ nX d _ 2 ■ vX d _ 3 . . . iiY 2 ■ vX x ■ hYq 



fi-^^nCPre^Xi) 

u 

^- 1 (2)nCPrei(y 2 ) 

0- 1 (d-2)nCPrei(X d _ 2 ) 
u 

O-^d-^nCPre^Fd-O 

We now present a lemma which states that in the structure T^ td , for location w-regular 
objectives, (1) memoryless region strategies suffice for winning, and (2) from states in the 
winning set there exists a winning memoryless region strategy tt\ such that all strategies 
region-equivalent to tt\ are also winning. The proof of the lemma can be found in the 
appendix. 

Lemma 2.7. Let 7 be a timed automaton game, £1 a location parity index function on states 
of 7, and 7 n the corresponding enlarged game structure with the parity index function 
Q td . Then, (1) there exists a memoryless region winning strategy tt\ for Parity(Sl td ) from 

Win 



(Parity(ft td )), and (2) if tx'y is a strategy that is region-equivalent to then tt[ is a 
winning strategy for Parity (f2 td ) from W\n{ (Parity(f2 td )). □ 

We say a strategy 7Tj is move-independent if for any two runs r, r' such that r[k] = r'[k] 
for all k > we have 7r.;(r[0..j]) = ir(r'[0..j]) for all j > 0. A move-independent region 
strategy is a region strategy that is move- independent. The following corollary follows 
from Lemma 12.71 observing that a memoryless strategy is a move independent strategy. A 
memoryless strategy in does not always have a corresponding memoryless strategy in 7. 
It may not even have a move-independent strategy in 7. This is because to infer the values 
of 3, tick, bl\ in 7, we need the values of the previous moves taken. The following proposition 
states that if player 1 has access to a global clock then move-independent strategies suffice 
in T (the proof can be found in the appendix). 
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Proposition 2.8. Let 7 be a timed automaton game such that the set of clocks includes a 
global clock z that is never reset and let <I> = Parity (SI) be an oj-regular location objective of 
7. Then, move-independent strategies in 7 suffice for winning TimeDivBli^). □ 

The next lemma states that memoryless strategies of player 2 suffice as spoiling strate- 
gies (the proof is in the appendix). 

Lemma 2.9. Let 7 be a timed automaton game, a location parity index function on states 
of 7, and T^ 1 the corresponding enlarged game structure with the parity index function fi td . 
Then, memoryless region strategies of player 2 in T^ td suffice for preventing player 1 from 

winning Parity(S7 td ) from a state s £ Wm{ (Parity(Sl td )) . □ 



3. Exact Winning of Timed Parity Games 

In this section we will present a reduction of infinite-state timed automaton games with 
parity objectives to finite-state turn-based games with parity objectives. The reduction 
gives us several results related to complexity and algorithms to solve timed automata parity 
games: (a) we obtain algorithms to solve timed automaton parity games with better time 
complexity than the algorithm presented in |dAFH + 03j ; (b) our reduction allows us to use 
the rich literature on algorithms for finite-state parity games for solving timed automaton 
parity games. 

Finite-state turn-based games. A finite-state turn-based game G consists of the tuple 
((S, E), (Si, S2)}, where (Si, S2) forms a partition of the finite set S of states, E is the set 
of edges, Si is the set of states from which only player 1 can make a move to choose an 
outgoing edge, and S2 is the set of states from which only player 2 can make a move. The 
game is bipartite if every outgoing edge from a player-1 state leads to a player-2 state and 
vice- versa. 

First idea of the reduction. Let 7 be a timed automaton game, and let 7° be the 

corresponding enlarged timed game structure that encodes time divergence. We shall con- 
struct a finite-state turn-based game structure 7? based on the regions of 7^ which can 
be used to compute winning states for parity objectives for the timed automaton game T. 
In this finite-state game, first player 1 proposes a destination region R± together with a 
discrete action a± (intuitively, this can be taken to mean that in the game T^ 1 , player 1 
wants to first let time elapse to get to the region R\, and then take the discrete action a\). 
The finite-state game then moves to an intermediate state which remembers the proposed 
action of player 1 in the game T^ 1 . Let us denote this intermediate state in 7? which spec- 
ifies the desired destination region and action of player 1 in T n by the tuple (R, Ri,ai). 
From this state in 7* , player 2 similarly also proposes a move consisting of a region R2 
together with a discrete action 02- These two moves in 7* signify that in the game in 
player i proposed a move (Aj, Oi) from a state s € R such that s + Aj 6 Depending on 
the move (A2,a2), the game in T* will then proceed from (R,R\,a\) to destination states 

Reg(<r (s, (Ai, a\)) J , or Reg (5 (s, (A2, 02)) J , or both, after the move of player 2 
depending on whether Ai < A2, or, Ai > A2, or, Ai = A2, respectively. The following 
lemma indicates that only the regions of s + Aj are important in determining whether the 
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move of player 1 or player 2 determines the successors after this two-step process. The proof 
is technical and is presented in the appendix. 

Lemma 3.1. Let 7 be a timed automaton game, Q a parity index function on states of 7 
and let Y, Y/, Y 2 be regions in the enlarged timed game structure 7^ . Suppose player-i has 
a move (Ai, J_j) from some s G Y to s"j G Y- , for i G {1, 2}. Then, for all states s G Y and 
/or a// player-1 moves mf = (Ai,ai) s + Ai G Y/ and ai ^ one of the following 
cases must hold. 

(1) Y{ ^ Y 2 ' and for all moves m 2 = (A2,a2) of player- 2 with s + A2 € Y 2 ' ; we have A± < 
A2 (and hence blamei(s", mf , m|, 5(s~, mf )) = true and blame2(?, mf , m|, <5(s, m|)) = 

FALSE,). 

(2) Y/ 7^ Y 2 ' and /or all player-2 moves m 2 = (A2,a2) with s + A2 G Y 2; we /love A2 < 
Ai (and hence blame2(?, mf , mf, 5(s", mf)) = true and blarneys', mf , mf, <5(s, mf )) = 

FALSE,). 

(3) Y[ = Y 2 and there exists a player 2 move mf = (A2, 02} with s"+ A2 G Y 2 sucre i/iai Ai = 
A2 (and hence blamei(?, mf , m|, 5 (J, mf )) = true and blame2(s', mf , mf, £(?, mf)) = 

TRUE ). □ 

Lemma 13.11 states that given an initial state in 12, for moves of both players to some fixed 
i?i,i?2j either the move of player 1 is always chosen, or player 2 can always pick a move 
such that player-1 's move is foiled. 

Let S^T = {X I X is a region of T^ d }, and let = S^T x x A±. Using 

Lemma I37T] a bipartite turn-based finite game = {(S? , E?), (S^ g x {1}, x {2})) can 
be constructed to capture the timed game T as follows. 

• The state space Sf is equal to SfQ x {1} U & x {2}. 

• Each (12, 1} G S^" x {1} encodes states in the timed game T° td that belongs to the 



region R. Sp"" x {1} are player-1 states. 

• Each (Y, 2) G S 1 ' x {2} encodes the following information: (a) the previous state of 7* 
(which corresponds to a region R of 7 n ), (b) a region 12' of (representing an 
intermediate state which results from time passage in T° from the state in the previous 
region R to a state in R'), and (c) the desired discrete action of player 1 to be taken from 
the intermediate state in R'. x {2} are player-2 states. 

• An edge from (12,1) to (Y, 2) = (R, R' ,a±,2) would represent the fact that in the timed 
game T^ td , from some state s G R, player 1 has a move (A, ax) such that s + A is in 
the intermediate region component 12' of (Y, 2), with a% being the desired final discrete 

action. From the state (Y, 2), player 2 would have moves to S Reg x {1} depending on 

what moves of player 2 in the timed game T^ td can beat the player-1 moves from 12 to 

12' according to Lemma 13.11 
The construction alluded to above requires having a state space that has size roughly 
I^Reg | 2, l-^i 1 1 w here \S"^ g j is the number of regions of T^ td . An optimized construction was 
presented in [CHP08 such that the size of the state space was roughly 0(|<Sp |-M-|C|-|Ai|) 
where M is the largest constant and C the set of clocks in A^ td . The optimization was due 
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to the observation that a state in a region R does not have transitions to every region, but 
only to a few restricted ones. 

Second idea of the reduction. We will show in subsection 13.21 that the strategies of 
both players in the timed game can be be restricted so that from any state s, each 
player can only propose moves (Aj,aj) such that the discrete action &j is taken either from 
the current region, or from the two following successor regions. That is, the cardinality of 
the set {Reg(s + A) | < A < Aj} is at most 3. The winning set remains the set with 
strategies being restricted in this manner. This result allows us to restrict the size of the 

state space in the region pair construction mentioned above to 0(\S^ \ ■ \Ai\). A further 

Sn td 

optimization allows us to have a state space linear in |Sr |. 

Outline of Section [3j In subsection 13 .1\ we show that the strategies of both players can be 
restricted so that the edges of the timed automaton game are taken from the current region, 
or from the two following successor regions; with the restriction not changing the winning 

set Wir^ (Parity(il td )). We call these restricted games 3-region timed parity games. Then 
in subsection 13. 2^ we reduce these 3-region timed parity games to finite-state turn-based 
games, the state space of the turn-based game being linear in the number of regions of CT" . 



3.1. Reduction to 3-Region Timed Parity Games. 



rfn td ffn td 

3.1.1. 3-Region strategies. We define the boolean functions Succr - : S J x IR> h-> 
{true, false} for j G {2, 3} as 

Succr^f? A) = / TRUE if l{ Re S( ? + A') | < A' < A}| < j 
i ^ ' ' \ false otherwise 

For player i, with i G {1, 2}, we say a strategy 7Tj is a 3-region strategy of T^ td if for any run 
prefix r[0..fc], we have that 7Tj plays a move of duration A such that r[k] + A is at most in the 

second following region. Formally, 7Tj(r[0..fc]) = (A^,af) with Succr^ (r[k],Af) = true. 
We wish to show that in the game T^ td , we can restrict both players to using only 3-region 
strategies. Consider 3-region strategies of player 1. Informally, they suffice for winning as 
a) 3-region strategies allow time to diverge, and b) if iri is a player-1 winning strategy, then 
we can obtain a winning 3-region strategy tt* that plays the same moves as ix\ whenever m 
proposes moves within two successor regions; and plays simple time moves to the second suc- 
cessor region whenever tti plays a move to outside the second following region. The strategy 
ttI works in general because for a run r[0../c] such that tt\ proposes moves outside two succes- 
sor regions, a player-2 strategy iT2 can counter n\ by playing similar pure time move as ir\. 
Unfortunately this argument does not formally work, as a player-1 move makes the bl\ com- 
ponent true, and a player-2 move makes bl\ false, that is, 5(r[k], (A, J-i)) ^ S(r[k], (A, J-2)), 
the only difference being in the bl\ components. We get around this roadblock by working 
in another expanded game structure where the bl\ component is true only if a move of 
player 1 is chosen, and the move is either to the originating region, or to the immediately 
succeeding region. It turns out that this modification does not change the time divergence 
condition. We present this new game structure next. 
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3.1.2. The expanded game structure 7^ . Analogous to the definition of T^ td , let 7^m be a 

similar enlarged game structure, the only difference from T^ td being in the bl\ component 

(we will refer to the new bl\ component in the new game structure as 3bl). We denote the 

??n td f ntd 
transition relation by S 3&z ; and the joint transition relation by 5^ bl . In an augmented 

state {s,i,tick,3bl\,p) £ S aw , the component 3bli is true only if both of the following 
conditions are satisfied: 

(a) player 1 is to blame for the last transition, and 

if ^3u{{ S *, i *,Uck*,3bl* 1 ,p*),(A,a 1 )) = (s, h tick,3bl 1 ,p) 

then Succr 2 3 " ((s*,f, tick*, 3U* x ,p), A) = TRUE. 

A run r of 7 has corresponding unique run r in 7^ and r% in 7^, such that r is a projection 
of r and fa onto 7, given the starting values of 3, tick, bl\ , 3bl\ and p. The reverse also holds 
- for any run in the expanded game structures starting from (5,3, tick, bl\,p), we have a 
corresponding unique run in T from s. Observe that the available moves are the same in all 
game structures, and we may view the additional components of the states in the expanded 
game structures as being kept in memory by the two players in 7. A similar correspondence 
between strategies also holds, thus a strategy 7Tj in 7 has corresponding matching strategies 
in 7 Qtd and in 7^; and vice versa. 

The next lemma states that time diverges in a run f of 7^ when the run has infinitely 
many winning moves such that the moves allow a time elapse to a region farther than the 
immediate successor region. 

Lemma 3.2. Consider a run r £ 7 such that r = sq, (wi§, ttjSj), Si, (m\, m^}, ■ ■ ■ and the 
corresponding runs r and r 3 in 7^ td and 7^^ respectively. Suppose for infinitely many k 

we have Succr 2 {r[k], delay(m^, m 2 )) to be false (or Succr 2 (r^[k], delay (m^m^)) to be 
false). Then time diverges in the run r (and hence also in r,r%). 

Proof. We have that for all k > 0, except for the blame component, the remaining four 
corresponding components of the 5-tuples r[k] and of f~3 [k] match. Thus, 

Succrf" (r[/c],delay(mi,m 2 )) = Succrf" (r 3 [k], delay(mf', m 2 )). 

A region of 7^ td can be represented as a tuple R = {I, tick, bli,p,h,V(C)) (similar to 
clock regions in 7) where (a) h is a function which specifies the integer values of clocks 
h : C — > (IN n [0,M]) (M is the largest constant in 7); and (b) V(C) is a disjoint partition 

of the clocks (C_i, Co, . . . C n ) such that l+JCj = C , and Cj ^ for i > (see Section [2] for 
details on regions and this representation). 

Consider the clock partitions V k {C) = {C^, Cq, . . . CK) of the regions Reg(?[£;]). Sup- 
pose C^i ^ 0. Then, the immediate time successor of the region will have C_i = 0, and 
Cj = Cji! for n k + 1 > j > 0. Suppose C k 1 = 0, then the immediate time successor 
of the region will have C_i ^ 0; this happens because all the clock values in C k k reach 

an integer boundary. Suppose Succr 2 {r[k], delay(mj, m 2 )) = false. This means that 
Reg(f[fc] + delay(m^, m 2 )) is at least two region successors away from Reg(f[£;]). Thus, 
some clock must be crossing an integer boundary (greater than 0) between time(r, k) and 
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time(r, k + 1). Since Succr 2 (?{&], delay (mj, m 2 )) = false for infinitely many k, we must 
have that some clock crosses an integer boundary greater than 1 infinitely often. Observing 
that at least one time unit must pass between any two such crossings, we have that time 
diverges in the run r (and hence also in r, P3). □ 

We next show that the change in the bl\ component in 7^ does not change the 
time divergence condition. Given an objective <£, let TimeDiv3Bli(<3?) denote the objective 
(□O tick ->•$) A (-.DO tick -> OHH3&Zi). 

Lemma 3.3. Let 7 be a timed automaton game, Q a location parity index function on 
states of 7, and 7^ the corresponding enlarged game. Consider a run r G T and the 
corresponding runs r and r 3 in "y 1 and 7^ bl respectively. The run r belongs to the objective 
TimeDivBli(Parity(fi)) = (DO tick Parity(Sl)) A (-.DO tick ->■ On-, bh) iff the run r 3 
belongs to the objective TimeDiv3Bli(Parity(f2)) = (DO tick — > Parity(Sl)) A (->□<> tick — > 
OO^Sbh). 

Proof. We have that for all k > 0, except for the blame component, the remaining four 

corresponding components of the 5-tuples r[k] and of r 3 [k] match. Hence the membership 

of the run r in the objective TimeDivBli(Parity(ri)) differs from the membership of the run 

f~3 in the objective TimeDiv3Bli (Parity (Q)) only when both of the following conditions hold: 

(a) time converges on both runs (i.e., (DO tick) = false), and (b) bl\ is true infinitely often 

in r and 3bl 1 is true only finitely often in r 3 ; or vice versa. Observe that as the first four 

components of r[k] and in r 3 [k] match, and both runs correspond to a run r in 7, we have 

bl\ to be true whenever 3bl\ is true. Thus, we cannot have 3bl\ to be true infinitely often in 

r 3 and bl\ to be true only finitely often in r. Thus, we can restrict our attention to the case 

where bl\ is true infinitely often, but 3>bl\ is only true finitely often. The bl\ component 

of r[k] differs from the 2>bl\ component in r 3 [k] for k > 1 only when a move (A fe_1 , a^ 1 ) 

of player 1 is chosen from the state r[k — 1] (and correspondingly from r[k] and 7*3 [A;]), 

jn td jn td 
and we have Succr 2 3 " (r 3 [k — l],A fc_1 ) = false (note that Succr 2 3 " (r 3 [k — l],A fc_1 ) = 

Succr 2 (r[k — 1], A fc_1 )). Thus, we must have that player-1 moves (A fc ,a^) were chosen 

f?n td 

from the state r 3 [k] such that Succr 2 (?3[&;],A fc ) = false for infinitely many k. Thus, 
from Lemma 13.21 we have the runs r 3 and r to be time divergent. Hence, the membership 
of the run r in the objective TimeDivBli (Parity (il)) must be the same as the membership of 
the run r 3 in the objective TimeDiv3Bli(Parity($7)), □ 

As in Lemma 12.51 t ne objective TimeDiv3Bli(Parity(f2)) can be expressed as a parity 
objective Parity^^). Proposition 12.31 and Lemma 13.31 give us the following proposition 
which states that we can consider games in 7^ to compute winning sets in T. 

Proposition 3.4. Consider a timed automaton game 7 and a location parity objective Q on 
states of 7 with 7^ bl being the corresponding expanded game structure. Consider a run r = 
s°, (m®, m 2 ), s 1 , (m\, m 2 ), ... in 7. Let r 3 be the corresponding run in 7^ such that r 3 = 
(s°,3° , tick , 6/?,^°), (m?,m^), (s 1 , 3 1 , tick 1 , bl\,p l ),{m\, m\) with f = 0,tick° = false, 
bl\ = false, p° = 0. Then r G TimeDivBli(Parity(f2)) iffr 3 G TimeDiv3Bli(Parity(ft)) (i.e., 
? 3 e= Parity(Sl^), or equivalently, f 3 G (DO tick -)■ Parity(Sl)) A (->□<> tick -> OCH3&Zi). 

□ 
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7rJ(f3[0..fc]) 



3.1.3. Sufficiency of 3-region strategies. We next show that we can restrict both players to 
use only 3-region strategies in the game structure 7^ bl without changing the winning sets. 
We first consider 3-region strategies of player 1. The following lemma is a key lemma of the 
paper and an example is presented after the proof to help illustrate its workings. 

Lemma 3.5. Consider a timed automaton game 7 with Q a location parity function on 
states of 7 and 7^^ the corresponding expanded game structure. Then, 3-region memoryless 

T" td 

region strategies of player 1 suffice for winning from Wir^ 3 " (TimeDiv3Bli(Parity(f2))). 

Proof. As in Lemma 12.71 let m be a player-1 winning memoryless region strategy in 7^^ 

for TimeDiv3Bli(Parity(f2)) (the results of Lemma 12.71 also hold for the structure T^ d ). 
We construct a 3-region memoryless region strategy tt\ of player 1 which wins against 
all strategies of player 2. Intuitively, from a state s, the strategy ir\ prescribes (a) the 
same move as tt\ when tt\ prescribes a move to either the current region or to the two 
immediately succeeding regions (b) prescribes a relinquishing move _L* when n prescribes a 
relinquishing move, and (c) prescribes a time blocking move to the second region following 
Reg(s) whenever 7Ti proposes a move to outside the second succeeding region. Formally, let 
r 3 be any run in 7^ b \ . The strategy 7r* is specified by: 

n*2 td 

(A,oi) if 7Ti(f 3 [0..Jfc]) = (A,ai) and Succr 3 3W (r 3 [k],A) = true, 

and a\ 7^ _L* 
(0,J_») if 7ri(f 3 [0..ft]) = (A,_L,) 
(A, J_i) if 7ri(f 3 [0..fc]) = (Ai,ai), a\ ^ _L*, and 

Jf2 td 

Succr 3 3W (r 3 [A;],Ai) = false; where A is any real number 
such that \{Reg(? 3 [k] + A') | < A' < A}| = 3. 

Note that ir* is a memoryless region strategy as 7Ti is a memoryless region strategy. 

T fltd 

We claim n\ wins against all strategies of player 2 for every state s G Wirq 3 " . Sup- 
pose this is not true. Let tt% be a spoiling move-independent region strategy of player 2 

against ttI from s € Wiri]^ 3 " (move-independent region strategies suffice as spoiling strate- 
gies by a lemma corresponding to Lemma 12.91 for the structure T^t). Suppose r^ € 
Outcomes(is, 7r^, tt^) and f| ^ TimeDiv3Bli(Parity(r2)). We show that in that case we can 
construct a player-2 spoiling strategy for m, contrary to the assumption that tt\ was a 
winning strategy. Intuitively, given a finite run r 3 [0..fc] the strategy 7^: 

(1) Acts like 7r| when n\ proposes a relinquishing move. 

(2) Acts like tt| when tt\ proposes moves within three regions. 

(3) Acts like when tti proposes moves outside three regions and H2 proposes moves of 
shorter duration than tt\ (observe that 71"^ moves are pure time moves in case tt\ proposes 
moves outside three regions). 

(4) Proposes the same time delay moves as 7r* when tti proposes moves outside three regions, 
and 7T 2 proposes moves longer than 7rJ ; 

(5) Proposes the same moves as 7r| when n\ proposes moves outside three regions, tx\ 
proposes moves of exactly the same duration as tt*, and the move of tt^ is chosen at 

7%[k + l]. 
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(6) Proposes the same time delay moves as ir{ when 7Ti proposes moves outside three regions 
and when ir 2 proposes moves of exactly the same duration as ~k\ if the move of 7r* is 
chosen at r^[k + 1]. 

Formally, the player-2 strategy tt 2 (dependent on the strategies 7Ti,7r* and tt 2 ) is defined to 
be: 

" n* 2 {r 3 [Q..k]) if 7n(f 3 [0..fc]) = (A,oi), and 
a\ = _L* 

ir* 2 (r 3 [0..k]) if 7n(f 3 [0..fe]) = (A,oi), and 

?r« td 

Succr 3 (r 3 [k],A) = true 



7r 2 (? 3 [0..&]) 



and 



ir5(f 3 [0..fc]) if 7ri(f3[0..A;]) = (A, oi), oi + JL 

Succrg" (f 3 [/c], A) = false; 
7r;(f 3 [0..A;]) = (A',±i>, and 
7r*(? 3 [0..fc]) = (A*,aJ|) with A* < A' 

(A',i_ 2 ) if 7n(f 3 [0..fe]) = (A,ai),oi + T*, and 

Succr 3 (f 3 [fc], A) = false; 
7rI(r 3 [0..A;]) = (A',±i), and 
vr*(f 3 [0..fe]) = (A*, a* 2 ) with A* > A' 



ir* 2 (r 3 [0..k]) 



if 7ri(f 3 [0../c]) = (A,ai),oi ^= _L*, and 

Succr 3 (f 3 [/c], A) = false; 
7r;(f 3 [0..A;]) = (A',±i>, and 
vr|(f 3 [0../c]) = (A*, 02) with A* = A'; and 

(r* 3 [k] , ir* 2 (r* [0. .k] ) ) = 7% [k + 1] , and 

^ d (f5[fc], 7ri *(^[0..fc]))/f|[fc + l] 



<A',± 2 ) 



if 7Ti(f 3 [0..A;]) = (A,oi),ai / ±* 

Succr 3 " (f 3 [/c], A) = false; 
f 3 [0..fc]) = (A',_Li), and 



and 



7T 



vr|(f 3 [0../c]) = (A*, 4} with A* = A'; and 



Note that 7r 2 is a memoryless strategy 



as 7Tl , 7T 

J 3M 



1; 



%[k + l] 
ir 2 are all memoryless) 



We show that 



7r 2 is a spoiling strategy for 7Ti from s € Win 1 3W . This is a contradiction since tti was as- 
sumed to be a player-1 winning strategy. We show there exists a run f 3 € Outcomes^?, m, tt 2 ) 
such that r 3 [k] = r^[k] for all k > (recall that 73 is the run used in defining ir 2 , and is 
such that r% G Outcomes(s, 7r*, 712) and fg ^ TimeDiv3Bli(Parity($7))). 

We proceed by induction on k. For k = the claim is trivially true. Suppose the 
claim is true for all j < k. Thus, we have a run r 3 such that r 3 [j] = for all j < k. 
We show that the run r 3 [0..fc] can be extended to r 3 [0..k + 1] according to 7Ti, tt 2 such that 
f 3 [fc + 1] =73(^ + 1]. Informally, we have the following cases (we can ignore the moves taken 
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and only focus on the states in the runs 7-3 [0.. A;], which is the same as the states in the run 
r|[0..fe] since tt\ , tt2 , vr* , tt^ are all memoryless and hence move independent): 

(1) The strategies tt\ and tt\ propose a relinquishing move for the state sequence of ?3[0..fc] 
(which is the same as the state sequence 73 [0.. A;] by the inductive hypothesis). In this 
case the move proposed by player 2 is the same for 1x2 and for tt^, so the next state 
r^[k + 1] is the same as r^[k + 1]. 

(2) For the state sequence ?3[0..&;], the strategies 7Ti and 7rJ propose the same non-relinquish- 
ing move to either the current region, or the next two immediately succeeding regions. 
In this case the move proposed by player 2 is the same for TT2 and for tt|, so the next 
state r$\k + 1] is the same as r^[k + 1]. 

(3) The strategy iri proposes a move outside the second succeeding region, 7rJ proposes a 
delay move to the second succeeding region, and it\ proposes a move shorter than 7r*. 
In this case, the strategy tt2 proposes the same move as 7r|, and hence the same move 
of player 2 determines both r^[k + 1] and r^[k + 1]. Thus, the two states are equal. 

(4) The strategy %\ proposes a move outside the second succeeding region, 7r| proposes a 
delay move to the second succeeding region, and ir^ proposes a move longer than tt^. 
In this case, the strategy 1:2 proposes the same delay move as 7T*, and hence the state 
r^[k + 1] is the same as r^[k + 1] as both states are determined by delay moves of equal 
duration. The 3bl\ component remains FALSE in both r^[k + 1] and r%\k + 1] as the 
transition is outside the immediately succeeding region. 

(5) The strategy t\\ proposes a move outside the second succeeding region, 7rf proposes a 
delay move to the second succeeding region, ir^ proposes a move of exactly the same 
duration as 7rJ, and the move of player 2 according to the strategy tx\ determines 
r^[k + 1]. In this case 1x2 behaves like and hence the state 7*3 [A; + 1] is the same as 
f|[A; + 1] as both are determined by the same move of player 2. 

(6) The strategy t\\ proposes a move outside the second succeeding region, 7rf proposes a 
delay move to the second succeeding region, ir^ proposes a move of exactly the same 
duration as 7rJ, and the delay move of player 1 according to ir\ determines the next 
state r%\k + 1]. In this case iT2 proposes a delay move of the same duration as vr* and 
hence the state r^\k + 1] is the same as f|[fc + 1] as both are determined by delay moves 
of equal duration. The 3bl\ component remains FALSE in both 7*3 [k + 1] and r^[k + 1] 
as the transition is outside the immediately succeeding region. 

The full details can be found in the appendix. 

Thus, in all cases, we have that r^[0..k] can be extended to r3[0..k + 1] according to 
7ri,7T2 such that rz[0..k + 1] = f|[0..A; + 1]. Hence, we have r$ G Outcomes^, m, 1:2) and 
7^3 ^ TimeDiv3Bli(Parity(0)) as f| ^ TimeDiv3Bli(Parity(0)), a contradiction since -k\ was 
assumed to be a winning strategy. Hence, we cannot have the existence of the strategy 1^2 
from which and tt2 were derived, i.e., ir\ is a winning strategy for player 1 from s'. O 

The next example illustrates the above lemma and the usefulness of the structure 7^ . 

Example 3.6. Consider the timed automaton T in Figure HI Suppose the objective of 
player 1 is to reach the location I 1 starting from 1°. Player 1 controls only the edge a±. 
Player 2 controls the other edges a\ and a|. Player 1 wins from 1° so long as x < 4. Let 
r[0..fc] be a run such that a x < 4 where a x is the value of clock x in state r[k]. A winning 



22 



K. CHATTERJEE, T. A. HENZINGER, AND V. S. PRABHU 




Figure 4: A timed automaton game 7. 



strategy for player 1 is given by 

(r r 0J u )= (3-a*,ai> if a x < 3 
v 1 " lJ (0, a\) otherwise 

Based on m, a 3-region winning strategy 7r* can be obtained as in Lemma 13.51 Intuitively, 
from a state s, the strategy 7r| prescribes (a) the same move as 7Ti when tt\ prescribes a move 
to either the current region or to the two immediately succeeding regions, and (b) prescribes 
a time elapsing move to the second region following Reg(s) whenever 7Ti proposes a move 
to outside the second succeeding region. 

Let the initial state be so = (l°,z = x = y = 0, tick = false, 3bl\ = false) (we skip 
the parity component for simplicity). We consider a sample play from this state. Initially, 
7r^ proposes the move (1, _l_i) from sb- Suppose it is allowed by player 2. The resulting state 
is then si = (1°, z = 0,x = y = 1, tick = true, 36Zi = false). In the second step, tt\ again 
proposes the move (1, J_i) from si. the resulting state is s~2 = (1°, z = 0, x = y = 2, tick = 
true, 3bl\ = false). In the third step, 7r* proposes the move (1, a%) which is blocked by the 
player-2 move (0, a\), leading to the state S3 = (1°, z = 0, x = 0, y = 2, tick = false, 3bl\ = 
false). Player 2 can then take the action a\ a finite number of times, but eventually it 
must let time elapse. When it does, player 1 is able to take the action a± in three steps. 

Now we show that there exists a player-2 strategy 1x2 such that tti and 7T2 result in 

the same sequence of states sb, si, 3*2 > The strategy tt2 proposes a time elapsing move 

(1, 1-2 ) from so and then again from si. It then proposes (0, a^) from S2 as against 7r*, and 
then again (1, ±2) f° r ° ne more step. It can be verified that this leads to the same sequence 
of states as with ir*. 

Suppose we had been working in the structure T^ td . Then the states si and S2 would 
have had bl\ = true. Thus, in reaching I , the strategy 7r* would have resulted in some 
states having bl\ = true, in contrast to the strategy tv\ which manages to have bl\ = false 
until I 1 is reached. Thus iri and tt^ would on the surface appear to be inherently incompatible 
with respect to objectives. In this example however, it can be seen that we may as well 
work with 3bl\ as player-1 time-elapsing moves can be chosen only finitely often, thus 3bl\ 
and bl\ differ only finitely often. □ 

Next, we show that 3-region memoryless strategies of player 2 suffice as spoiling strate- 
gies. An example will be presented after the lemma to illustrate the sufficiency of 3-region 
strategies. 

Lemma 3.7. Consider a timed automaton game 7 and a location parity objective f2 on 
states of 7 with 7^ bl being the corresponding expanded game structure. Suppose s ^ 

Win 1 3W (TimeDiv3Bli(Parity(f2))). Then, 3-region memoryless region strategies of player 2 
suffice for preventing player 1 from satisfying the objective TimeDiv3Bli (Parity (O)) from s". 
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Proof. The objective TimeDiv3Bli(Parity(0)) corresponds to the parity objective Parity(S7 t ) 
by Lemma 12.51 (replacing bl\ by 3bl±). As mentioned in Section [21 there exists a game fjr 

f« td 

calculus formula which precisely characterizes the winning set Win^" (Parity(ft td )). The 
winning sets can be obtained by a ^-calculus iteration. The iteration uses the controllable 
predecessor operator for player 1, CPrei : 2 s i— >■ 2 s (where S is the state space of Ogy ), 

defined formally by s G CPrei(Z) iff 3mi G r i 3W (?) Vm 2 G T 2 3W (?) . <5 jd 3W (?, mi,ra 2 ) C Z. 
Informally, CPrei (Z) consists of the set of states from which player 1 can ensure that the 
next state will be in Z, no matter what player 2 does. It can be shown that CPrei preserves 
regions of 7^ bl using Lemma [3. 11 The iteration also suggests winning strategies for player 1 
based on the sets that arise in the iteration. The sets that arise depend on the parity 
labeling, the CPrei an d the fixpoint operators. If we have a location parity objective, it 
can be shown that only unions of regions arise as sets in the fixpoint iteration. To prove 
that 3-region memoryless region strategies of player 2 suffice as spoiling strategies, it hence 
suffices to show that the CPrei sets of unions of regions remains unchanged if we restrict 
player-2 strategies to be 3-region memoryless region strategies. 

Let CPre\^(Z) denote the set of states from which player 1 can ensure that the next 
state will be in Z, no matter what move player 2 takes within 3 regions. Also, clearly 
CPrei 5 3(Z) depends only on Z, and not on the history of the game (hence we shall have 
memoryless 3-region spoiling strategies). We show CPrei^Z) = CPrei(Z) for Z a union 
of regions. Clearly CPrei(Z) C CPrei^Z) as player 2 has fewer moves to counter with in 
CPrei 5 3(Z). To prove the other direction, we show if ? G CPrei^Z), then ? G CPrei(Z). 
We first characterize the CPrei sets. A state ? G CPrei (Z) iff either one of the following 
conditions is met: 

(1) {S(s, (A,a 2 » | (A,a 2 ) G r 2 (?)} C Z. 

(2) There exists (A,ai) G Ti(?) with a\ ^ _!_* such that 

(a) 5(s, (A, a\}) G Z, and 

(b) {S(s, (A',a 2 » | A < A, and <A',a 2 ) G T 2 (?)} C Z. 

The first condition corresponds to the case when player 1 proposes a move (A, J.*} from 
?. In this case, the move of player 2 will be chosen, no matter the move. Thus, we must 
have that no matter the move of player 2, the resultant state must be in Z. The second 
condition corresponds to the case when player 1 proposes a move (A,ai) from ? with 
ai ^ In this case, every move (A', a\) of player 2 with A' < A must lead to Z. A 
similar characterization exists for CPrei^Z), the only difference being that player-2 moves 
are restricted to be within 3 regions: a state ? G CPrei^Z) iff either one of the following 
conditions is met: 

(1) {6(s, (A,a 2 )) | (A,a 2 ) G T 2 (s) and Succr 3 (?, A) = true} C Z. 

(2) There exists (A,ai) G Ti(?) with a\ ^ _L* such that 

(a) 8{s, (A,ai)) G Z, and 

(b) {5(s, (A',o 2 » | A < A, Succr 3 (s*, A) = true, and (A,a 2 ) G T 2 (s)} C Z. 
Now we show that if ? G CPrei^Z), then ? G CPrei(Z). Suppose ? G CPrei j 3(Z). Infor- 
mally, we can have the following cases (the formal details are in the appendix): 

• The available moves of player 2 are only until the second immediately succeeding 
region. In this case, the restriction of player-2 strategies to be 3-region strategies 
has no effect, hence ? G CPrei (Z). 
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Figure 5: Regions of the timed automaton game T of Figure HI 



• Player 2 has a move from s to a state that is farther than the second succeeding 
region. This means that it has a delay move to the second succeeding region. In this 
case, player 1 can play the same delay move to the second succeeding region. The 
resulting state will be the same as due to the delay move of player 2 to the second 
succeeding region, because 3bh will be false in both cases. Moreover, all moves of 
player 2 which lie in the current or the next two regions result in a state in Z by 
assumption. In particular, the delay move of player 1 will also result in a state in 
Z. Thus, ?<E CPrei(Z). 
Thus, in all cases s£ CPrei(Z) whenever s£ CPrei^Z). □ 

Example 3.8. Consider the timed automaton game in Figure H] and the following regions. 

3, y, tick = true, 36Zi = false) | y > 2} 

y = 1, tick = true, 3bl\ = false)} 

, tick = FALSE, 3bl\ = false) I < X < 1} 

0, y = 1, tick = true, 3bh = false)} 

0, tick = false, 3bl\ = false) I < y < 1} 

y = 0, tick = false, 3bl± = false)} 

The regions can be seen as the thick lines and dots in Figure [5] (the clock z is not shown 
for simplicity) . Let Z = U j=o Rj ■ 

The state s = (1°, z = 0, x = y = 0, tick = false, 3bh = false) belongs to CPrei j3 (Z). 
This can be seen as follows. From state 'sq, player 1 proposes the move (3, ai). Player 2 can 
only propose a counter move (A, a*) such that < A < 1 and a* S {^2, 0-2}- Now observe 
that s"o also belongs to CPrei(Z). To see this, we have to consider a different player-1 move, 
namely (l,_l_i). If player 2 allows this move of player 1, the next state will be in R4 (the 
variable 3bl± remains false as the allowed move of player 1 is not in the immediate successor 
region of so)- If player 2 instead proposes a shorter (or equal) duration move, the next state 
will be in IJ j=o ^3 • 

Let Z* be any set of states which includes R§. Suppose so G CPrei^Z*) or so £ 
CPrei(Z*). Then it can be seen that Z* must contain the states from Z. Hence, we have 
so € CPrei ; 3(Z*) iff so £ CPrei(Z*). In general, 3-region strategies of player 2 suffice as if 
there exists a move of player 2 beyond the two immediately succeeding regions, then both 
player 1 and player 2 have simple time passage moves to the second succeeding region, with 
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the variable 3bli remaining false if any of these two moves is chosen. To mimic a general 
player-2 counter move, a 3-region strategy of player 2 can play simple time passage moves 
to the second successor regions till it gets to a state from which a desired player-2 edge lies 
within two successor regions. □ 

Given an objective Parity(f2), let Win 1 ' 3 (Parity ($7)) denote the states in the expanded 

game structure T3 such that for all states s € Win^'' r3 (Parity(r2)), player 1 has a 3-region 
strategy m such that for all 3-region strategies 7T2 of player 2, we have Outcomesfs, m, 1^2) Q 
Parity . The following theorem, which follows from Lemmas 13.51 13.71 and 13.31 states that 
to solve for Time Div3Bli(Parity(ft)) in , we can restrict both players to use only 3-region 
strategies. We call such games 3-region parity games. 

Theorem 3.9. Consider a timed automaton game 7 and a location parity objective Q on 
states of 7 with 7^ bl being the corresponding expanded game structure. We have 

WinJ' 3W (TimeDiv3Bli(Parity(ft))) = Win^ 3W (TimeDiv3Bli(Parity(ft))) 

ffo td 

= Win^ 3 " (TimeDivBli(Parity(fi))). 

□ 



3.2. Reduction from 3-Region Parity Games to Finite-State Turn-Based Parity 
Games. We gave a rough construction of a bipartite turn-based game to capture the win- 
ning sets of timed parity games at the beginning of this section . The idea of the construction 
being that first player 1 proposes a move representing its intention to let time pass to get 
to an intermediate region of 7^ bl and then take a discrete action from that region. This 
intermediate region together with the action and the originating region can be encoded as 
a player-2 state in the finite game as (R, R' , a±) with R denoting the originating region, R' 
representing the intermediate region to let time pass to, and a\ the player-1 action to take 
from R' . Note that this player-2 state (R, R' ,a±) corresponds to an intermediate step in the 
game 7^ bl , where from a state s G R player 1 has just proposed its move (say (Ai,a±) with 
the time delay A leading to the region R' from which the discrete action a± will be taken. 
The game of 7^ bl is still in the state s, waiting for the move from player 2. In the finite- 
state game, from the player-2 state (R, R' , ai), player 2 then takes an action depending on 
whether it would allow the previously proposed player-1 move from the originating region 
R in 7^ bl or not. This corresponds to the move of player 2 in 7^ bl where it allows or dis- 
allows the player-1 move (Ai,oi) from s (hence the element of "surprise" from |dAFH+03] 
remains in this finite-state game). From Theorem 13.91 we can consider parity games where 
each player is restricted to use only 3-region strategies. This allows us to restrict the pairs 
of regions that can occur as player-2 states in the finite game. 

Outline of subsection 13.21 We first present a finite-state turn-based game 7^ which can 
be used to compute the winning sets of T. The size of the state space will be 0(\S R ™ | ■ \Ai\) 
where <S R 3 " is the set of regions in 7^ bl and A\ is the set of actions of player 1 in 7. We then 
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f * / f * 

present a finite-state turn-based game Tq such that (1) the winning set of corresponds 

;?o td 

to that for 7^, and (2) the size of the state space of is 0(\S R f' 



3.2.1. Construction of the finite turn-based bipartite game T^. Given a timed automaton 7, 
and a location parity index function Q on states of T, the bipartite finite turn-based game 
Tq consists of a tuple (S? ,E? , S(, S[) where, 

• = S{ U s£ is the state space. The states in S( are controlled by player-i for i G {1, 2}. 

• S( = SReg x where S Reg is the set of regions in 7^ bl . 

• S f 2 = S Tup x {2}, where S Tup = S Reg x {0, 1, 2} x Af- . 

• E 1 ^ contains the following edges: 

— For s{ = (R,l) G S(, the set E$ contains the outgoing edges ((i?, 1), (R, j, a\, 2)) 

such that in the structure 7^ bl , there exists s G i! and (A,ai) G I^ 3 " (?) with 
|{Reg(s'+ A') | A' < A} | = j + 1. These types of edges encode in Jf the fact that in 
the game structure 7^ bl there exists a state in i? such that player 1 has a move such 
that time passes to the j-th following region from where the discrete action a± is taken. 

— For s{ = (R,j, oi, 2} in , the set contains the outgoing edges ((R,j, a\,2), (R', 1)) 
such that in the structure 7^ bl , there exists s G R, (Ai,oi) € ri(s) and {A2,a 2 ) € 
r 2 (s) with |{Reg(s + A') [ A' < Ai}| = j + 1 such that either 

(1) a t + l. m , |{Reg(s + A') | A' < A 2 }| < j + 1 and 5{s,(A 2 ,a 2 }) G R'. 

This edge corresponds to the case when player 2 does not allow a player- 1 move 
(Ai, at) from a state s € i? by proposing its own move (A2, a 2 ) such that A2 < Ai. 

(2) ai / ±*, 3 > |{Reg(?+ A') | A' < A 2 }| > j + 1 and <5(s, (Ai, 01)) G 

This edge corresponds to the case when player 2 allows a player-1 move (Ai,ai) 
from a state s G R by proposing a move of duration A2 > Ai. 

(3) 01 = .1* and 6{s, (A 2 , a 2 }) G 

This edge corresponds to the case when player 1 plays a _L* move so the move of 
player 2 is chosen. 

Note that we can pick any states in R and R' to check the satisfiability of the above 
conditions by Lemma 13. 11 

In the construction for above, the states in Sj up x {2} contain player-1 actions as 
components. The actions are used only for determining the destination states, so instead 
of player-1 actions as components of states in Sj up we can as well use a tuple from L x 2 C , 
with (I, a) denoting an action in T such that the destination location is I, and which resets 
the clocks in a. 

Each s-> G S* is a tuple, with the first component being a region of 7. Given the 
location parity index function on 7^ , we let $V be the parity index function on 
such that nf((R,-)) = ttf bl {s) where s is any state in the region R (all states in a region 
have the same parity as is a location parity index function on 7^). Note that if 
the parity index function is of order d, then fij^ and & are of order d + 2. Given a 

set X = Xx x {1} U X 2 x {2} C 5', we let RegStates(X) = {? G S 5 ?" | Reg(s) G Xi}. 
Theorem 13.101 shows that the turn-based game captures the game 7^ b f . 
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Theorem 3.10. Let T be a timed automaton game, £1 a location parity index function on 
states of T, with T^bi being the corresponding enlarged game, and 7? the corresponding 
finite game structure with the parity index function Q-f . Then, we have 

Winf^" (TimeDiv3Bli(Parity(ft))) = RegStates (Winf (Parity^))). 

Proof. Recall the ^-calculus formulation for obtaining the winning set 

Win 3. T 3M (TimeDiv3Bli(Parity(ft))) 

in T^bl ■ We use the Prei operator in turn-based games: 

{s G S Reg x {1} | 3s' G X such that (s, s') 6 E?} 
Prei(X) = " U 

{s G S Tup x {2} | V(s, s') G E* we have s' G X} 

Given X = X 1 x {1} U X 2 x {2} C £/, we have 

RegStates (Pref (Pref (X))) = RegStates (Vref (Pref (Xt x {1}))) (3.1) 

This is because the structure of the games Tq is such that a state with 1 as the last 
component of the tuple can reach X = X\ x {1} U X 2 x {2} in exactly two steps iff it can 
reach X\ x {1} in exactly two steps (the game is bipartite). 
From Lemma 13.11 it follows that 

RegStates (Pref (Vref (X l x {1}))) = CPreJ^' (RegStates (Xi x {1})) (3.2) 

Let (j) c be the //-calculus formula using the CPrei operator describing the winning set for 
Parity^g^) = TimeDiv3Bli(Parity(S7)) . Let <pt be the /x-calculus formula using the Prei 
operator in a turn-based game describing the winning set for Parity^) . As & and ttf bl 
contain the same number of priorities, the formula 4>t can be obtained from <j) c by syntacti- 
cally replacing every CPrei by Prei (see |d AHMOl] ) . Let the winning set for Parity(f2^ ) in 
be W\ x {1} U W 2 x {2}. It is described by 4>f The game in proceeds in a bipartite fashion 
— player 1 and player 2 alternate moves, with the state resulting from the move of player 1 
having the same parity index as the originating state. Note that the objective Parity(0^ ) 
depends only on the infinitely often occurring indices in the trace. Thus, W\ x {1} can be 
also be described by the ^-calculus formula (p' t obtained by replacing each Prei in & with 
Prei Prei, and taking states of the form s x {1} in the result. Consider the fixpoint itera- 
tion X^i for computing the set (j)' t . Since we are only interested in the set W\ x {1}, and since 
we have a bipartite game, the set W\ x {1} can also be described by an iteration X* in which 
each set in the iteration X^ is intersected with «SR eg x {1}. This is because each step in the 
iteration applies Prei Prei, and if X\ x {1}UX 2 x {2} = Prei Prei(Zi x {1}UZ 2 x {2}), 
then X\ x {1} = Prei P re i(Zi x {1}). Now, this new iteration I* describes the winning sets 
of the /i-calculus formula <f>" obtained from <p' t by intersecting every variable with SR eg x {1}. 
Using the identity 13. 2\ we have that the sets in the fixpoint iteration computation of <f>" 
correspond to the sets in the fixpoint iteration computation of (f) c , that is, if X x {1} occurs 
in the computation of <fi" at stage j, then RegStates(X) occurs in the computation of 4>c a t 
the same stage j. This implies that the sets are the same on termination for both 0" and 

<t> c . Thus, WinJ 3W (TimeDiv3Bli(Parity(0))) = RegStates(Win[ (Parity(^))). □ 
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From Theorem 13.101 we can solve the finite-state game to compute winning sets 
for all w-regular region parity objectives Parity for a timed automaton game T, using 
any algorithm for finite state turn-based games, e.g., strategy improvement, small-progress 
algorithms [VJOOl lJurOO] . 

3.2.2. Construction of the finite turn-based game • We now present the turn-based game 

7 J n which can be used to compute the sets RegStates(Win 1 n (Parity($V ))) of TA, with the 
state space of being linear in the number of regions of 7^ bl . The idea behind this 
construction is that when player 1 proposes a move (A, ai) in from state s, player 2 
can ignore the discrete component a\ of the move. Intuitively this is because player 2 should 
only care about the moves it can make before time A from state s. If there is a move it 
would like to make before time A, then player 2 should preempt the move of player 1. If 
there is no such move, then it should allow the move of player 1, irrespective of a%. The 
exact value of a\ (other than whether it is _L* or not) is irrelevant in determining the moves 
for which player 2 "wins" in a round. Since the value of a\ is irrelevant, there is no need 
to explicitly remember the discrete component of the moves of player 1 as was done in the 
game structure 7L. This allows us to bring the state space down, so that it is linear in 
the number of regions of T|^ d . In the game Tq, we broke a step of the game in T^bi ^ n ^° 
two steps. In the game , we break a step of 7^ bl into three steps: first player 1 makes 
a move, then, player 2, then player 1 and then the sequence repeats, i.e., the sequence of 
players making moves is (121)^. Note that this game is not bipartite as player 1 is making 
two consecutive moves. Informally, the three steps are as follows. 

(1) First, player 1 proposes either a 

• Relinquishing move, or 

• A move which corresponds to time elapsing to a desired region. The discrete part of 
the move is left for later. 

(2) Then, player 2 moves according to one of the following: 

• If player 1 had relinquished in the previous step, then it transitions to a player 1 
state such that the region component is the new region. In addition, this is a special 
player 1 state (we shall see what this means in a while). 

• If player 1 had proposed a non-relinquishing move to a region, then player 2 either 

— Blocks the player 1 move by proposing a move that corresponds to a shorter dura- 
tion move in T^bi • With this move, it transitions to a player 1 state such that the 
region component is the new region. In addition, this is again a special player 1 
state. 

— Decides to allow the player 1 move. But remember that we only know the time 
elapse part of the player 1 move till now. Thus, this move of player 2 transitions 
the game into an intermediate state from which player 1 can only propose the 
remaining discrete part of the move. 

(3) In the third stage of the game, if the state is not special, then player 1 proposes the 
remaining discrete part of its original move. If the state is special, then it only has one 
dummy move, which makes the special state non-special without changing the region 
component. The special states thus allow us to maintain a three stage game. 

Formally, given a timed automaton T, and a location parity index function on states of 
T, the finite turn-based game consists of a tuple (S?*, E?* , S( ,S( } where, 
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S 1 = S{ U ^ is the state space. The states in S( are controlled by player-i for 
i€{l,2}. 

S(* = S Reg x {l a ,l^ m } U S Reg x {0,1,2} x {l c }, where S Reg is the set of regions in 
7^ bl . Informally, the sequence of the players making moves is (121) w ; the moves from 
states labelled with l a as the last component will correspond to the first "1" and from 
states labelled with l dum or l c as the last component will correspond to the second "1" 
in (121) w . 

Sf = S Reg x {2} U S Reg x {0, 1, 2} x {2}. 
contains the following edges: 

— For s{ = (R, 1 ), the set E?* contains the following outgoing edges 

(1) ((R, l a ), (R,2)). These edges correspond to player 1 playing a relinquishing move 
from states in R in the game structure T|^ d . 

(2) ({R, 1 ), (R,j, 2)) such that in the structure 7f bl , there exists s£R and (A, a\) G 

F 1 3W (s) with |{Reg(s + A') | A' < A}| = j + 1. These types of edges encode in 
7?* the fact that in the game structure 7^ bl there exists a state in R such that 
player 1 has a move such that time passes to the j-th following region from where 
the discrete action a\ is taken. 

— For s{ = (R,2), the set E** contains the outgoing edges ({R, 2), {Rf , l d c um )) such 

that in the structure 7^ bl , there exists a state s G R and (A, 02) G r 2 3W (s) with 
<5(s, (A, 02)) G i?' and Succ^s, A) = true. These edges correspond to the fact in the 
game structure 7^ bl , player 1 had played a relinquishing move from some state s in R, 
and player 2 had played a move according to its 3-region strategy. 

— For s{ = (R,j,2), the set E?* contains the following outgoing edges: 

(1) ((R,j, 2), (R 1 , l dum )) such that in the structure T^ b i , there exists a state s G R and 

.;<>"' 

(A, 02) G r^" (?) with <5(s, (A, 02)) G and |{Reg(s + A') | A' < A}| < j + 1. 
These types of edges encode in * the fact that in the game structure there 
exists a state in R such that from s player 1 proposes a move such that time would 
pass to the j-th following region from where some discrete action would be taken, 
and player 2 is countering this player 1 move with a move of shorter (or equal) 
delay. 

(2) ({R,j, 2), (R,j, l dum )). This edge encode in 7$* the fact that in the game structure 
72 b \ there exists a state in R such that from s player 1 proposes a move such that 
time would pass to the j-th following region from where some discrete action would 
be taken, and player 2 is "allowing" this move of player 1. 

— For s{ = (R,j,l c ), the set E$* contains the outgoing edges ((R,j,l c ),(R',l)) such 

that in the structure 7^ bl , there exists a state s G R and (A,ai) G r i 3W (s) with 
S(s, (A, ai)) G ^R' and |{Reg(s + A') | A' < A}| = j + 1. This edge corresponds to the 
fact that in the game 7^ bl player 2 has allowed player 1 to take a move from a state s 
in R such that time passes to the j'-th following region from where the discrete action 
ai is taken. 

— For s{ = (R, lc um ), the set E f * contains the single outgoing edge ((R, l d c um ), (R, l a ))- 
This is just a dummy edge, to preserve the (121) w move pattern. 
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Given a set X = X u x {l a } U X Xc x {l c , l^ um } U X 2 x {2} C S f *, we let RegStates(X) = 
{s G | Reg(?) S ^i Q }- Given the location parity index function on 7^ , we let 

iV* be the parity index function on 7? such that $V ((R, •)) = ilg^(s) where s' is any state 
in the region R (all states in a region have the same parity as is a location parity index 
function on 7^ b \ )• 

The following lemma shows that two steps in the bipartite game 7^ correspond to three 
steps in the game . The proof is presented in the appendix. 

Lemma 3.11. Let 7 be a timed automaton game, £1 a location parity index function on 
states of 7, with 7^ bl being the corresponding enlarged game, and 7? , 7?* being the cor- 
responding finite game structures with the parity index functions $V and $V* respectively. 
Suppose X C S f and X* C S f are such that RegStates(X) = RegStates(X*). Then, 

RegStates (Pref (Pref (X))) = RegStates (Pref (Pref* (Pref * (X*)))) . □ 

Using the fact that two steps in the bipartite game correspond to three steps in the 
game , we have the following theorem which shows that we can use the game structure 
7^ to compute the winning sets for T. 

Theorem 3.12. Let 7 be a timed automaton game, Q a location parity index function 
on states of 7, with 7^ bl being the corresponding enlarged game, and 7? , 7^* being the 
corresponding finite game structures with the parity index functions & and Qf* respec- 
tively. Then, we have RegStates(WinJ / (Parity^))) = RegStates (Win^* (Parity^*))) = 

Win^ 3 "" (TimeDiv3Bli(Parity(ft))). 

Proof. Let the winning set in 7^ be W± x {1} U W 2 x {2}; and the winning set in 7q be 
W la * x {1J U W lc * x {l c , l^ um } U W 2 * x {2}. As proved in the proof of TheoremEZEUl the 
set W\ x {1} is also described by the //-calculus formula 0" obtained from the //-calculus 
formula 4> c , which describes the winning set for Parity^^), by (1) syntactically replacing 
each occurrence of CPrei in cj) c by Prei o Prei, and, (2) intersecting each variable in <j) c with 
5*Reg (where S^ eg is the set of regions of 7^ bl ). Using a similar argument, the set W\ a * x {l a } 
can be described by a //-calculus formula §* t obtained from <p c by (1) syntactically replacing 
each occurrence of CPrei in <f> c by Prei ° Prei o Prei, and, (2) intersecting each variable in <f> c 
with SR e g. By Lemma f3.Hl we have that the sets in the fixpoint iteration computation of <f>" 
correspond to the sets in the fixpoint iteration computation of that is, if X x {1} occurs in 
the computation of (ft" at stage j, then X x{l a } occurs in the computation of (ftl at the same 
stage j. This implies that the sets are the same on termination for both eft" and (ft*. Thus, 
Wi = W la *, and hence RegStates (Winf (Parity(O-f))) = RegStates(Winf '(Parity^*))) = 

Win*' 3W (TimeDiv3Bli(Parity(0))). □ 

Eliminating redundant states in T f *. The states with l^ um as the last component are 
redundant and were introduced just to have a three stage game to prove Theorem 13 . 1 2 1 using 
Lemma 13. Ill These redundant states can be removed as follows. The objective Parity($V ) 
depends only on the infinitely often occurring indices in traces. Consider the states in T-^* 
of the form (R, l^. um ). Player 1 has only one move from such a state, to (R, l a ). The parity 
of (R, lc Um ) is the same as the parity of (R, l a ). Thus, these kinds of states can be removed, 
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and the incoming edges made to point to the destination of these states without changing 
the part of the winning set with l a as the last component. That is, given a state (R, lc Um ) 
with m incoming edges (s-f*, (R, l^ urn )) for 1 < j < m, we can remove the state (R, l^ 1 "" 11 ) 

and add m edges (s-f*, (R, l a )) for 1 < j < m without changing the part of the winning set 
with l a as the last component. 



/J 3bl 

Reg 



7 Q 

where S R ™ 



3.2.3. Complexity of reduction. Recall that for a timed automaton game T, A{ is the set of 
actions for player i and C is the set of clocks. Let \A\\* = 1 + min \\A\\ + 1, \L\ ■ 2^1 } (the 
+1 outside is due to the relinquishing move of player 1, the min is because we can either 
use player-1 actions, including pure time moves; or use tuples of locations with clock reset 
sets as actions), and let | ^4.2 1 * = min{|A 2 | + 1, \L\ ■ 2l c 'l}. Let [TconstJ denote the length of 
the clock constraints in 7 and d be the order of the parity index function f2. The size of the 

state space of T-^* (with the l^ um states eliminated) is bounded by 8 • |5| 

is the set of regions of 7^f . The number of edges originating from player-1 states is at 

ffn td 

most |<SR eg I • (4 + 3 [ j4_i I * ) - The number of edges originating from player-2 states is at most 

\ S Re g I • 6 • 1^2 1*- The total number of edges is thus at most |S^ eg | • (4 + 3|^4i |* + 6|A 2 |*). 

We also have \S^ I to be bounded by 32 • d ■ \L\ ■ \[ xeC {c x + 1) • \C + 1|! • 4l°L 

Theorem 3.13. Let T be a timed automaton game, and let Q be a location parity index 
function of order d. The set WinTimeDiv^(Parity(ri)) can be computed in time 
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which equals 



O I l^egl 



d ■ \C\ ■ |Tconstr| + 



^Regl 



d -256- |C| 
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^Reg « 

f?n td 

^Reg 



'Regl 



where 

the set of regions of 7, with \S Reg \ = \L\ ■ Y\ x ec( c x + ■"■) ' 1^1' ' ^ > 

is the set of regions ofTf^, with |Sjgj [ = 32 • (|C| + 1) • d ■ |S£ e ' 
I^Constrl "is the length of the clock constraints in 7, 

• 1^1* = 1 + min + 1, |L| • 2 |c| } and \A 2 \* = min{|A 2 | + 1, \L\ ■ 2 |c| } with A* being 
the set of discrete actions of player i for i € {1, 2}, 

• L is the set of locations, and C is the set of clocks in 7. 

Proof. To solve for WinTimeDiv^(Parity(17)), we solve the turn-based game A? by Theo- 
rems [3]9] and [3J21 and Lemma 13.31 For constructing 7? , we need to check which regions 
satisfy clock constraints from T. For this, we build a list of regions with valid invariants 

together with edge constraints satisfied at the region. This takes 0(\S R ™ \ ■ |Tc nstr|) time 
(we assume a region can be represented in constant space in our analysis). From [Sch07] . 
we have that a turn-based parity game with m edges, n states and d parity indices can be 
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solved in 0(m ■ ns + 2) time. The turn-based game can hence be solved in time 

d+2 i 1 ■ 



o 



which is equal to 



I Q J :m 



(4 + 3|Ai|* +6\A 2 \*) 



o (i^r+i^n 



8-1^ 



ffn td 

I C Ml 



d+2 I 3 
3 ~ l ~2 



□ 



Remark 3.14 (Effects of receptive formulation). In the complexity, we multiply (S^gl by 
d as we go from Parity($7) to Parity(fi td ) = TimeDivBli(Parity(fi)) (this also increases the 
value of the exponent from | + | to ^rp + |). Multiplication of \S^ eg \ by 32 • \C\ comes in 
due to the extra clock z which measures global time, and introduction of the variables tick 
and 36?i . 

Usefulness of the finite turn-based game T f *. The reduction allows us to solve the 
finite-state game 7* to compute winning sets for all w-regular region parity objectives 
Parity (SI) for a timed automaton game T, using any algorithm for finite state turn-based 
games, e.g., strategy improvement, small-progress algorithms [VJOOl Uur00| . It also allows 
us to leverage any future improvements in algorithms for finite-state games for solving timed 
parity games. The reduction to a finite-state game that is linear in the number of regions 
of 7^ bl also shows that we do not pay any complexity penalty due to the concurrent nature 
of the timed game where both players simultaneously propose moves. 



4. Robust Winning of Timed Parity Games 

In this section we study restrictions on player- 1 strategies to model robust winning, and 
show how the winning sets can be obtained by reductions to general timed automaton 
games. The results of Section [3] can then be used to obtain algorithms for computing the 
robust winning sets. 

There is inherent uncertainty in real-time systems. In a physical system, an action 
may be prescribed by a controller, but the controller can never prescribe a single timepoint 
where that action will be taken with probability 1. There is usually some jitter when the 
specified action is taken, the jitter being non-deterministic. The model of general timed 
automaton games, where player 1 can specify exact moves of the form (A, a±) consisting of 
an action together with a precise delay A, assumes that the jitter is 0. In subsection 14. 1\ we 
obtain robust winning sets for player 1 in the presence of non-zero jitter (which are assumed 
to be arbitrarily small) for each of its proposed moves. In subsection 14.21 we assume the 
jitter to be some fixed ej > for every move that is known. The strategies of player 2 are 
left unrestricted. In the case of lower-bounded jitter, we also introduce a response time for 
player-1 strategies. The response time is the minimum delay between a discrete action, and 
a discrete action of the controller. We note that the set of moves with a jitter of £j > 
around (A,a\) contains the set of moves with a jitter of £j/2 and a response time of ej/2 
around (A,ai) (as {(A',ai) | A + ej/2 < A' < A + ej} C {(A',ai) | A < A' < A + ej}. 
Thus, the strategies of subsection 14.11 may be considered to have a response time greater 
than 0. The winning sets in both sections are hence robust towards the presence of jitter 
and response times. 
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4.1. Winning in the Presence of Jitter. In this subsection, we model games where the 
jitter is assumed to be greater than 0, but arbitrarily small in each round of the game. 

Given a state s, a limit-robust move for player 1 is either the move (A,_U); or it is a 
tuple ([a, /8],oi) for some < a < j3 such that for every A G [a, f3] we have (A,ai) G Ti(s) 
(|/3 — a| does not have a lower bound, it just needs to be greater than — hence the 
term limit-robust). Note that a move (A, J.*} for player 1 implies that it is relinquishing 
the current round to player 2, as the move of player 2 will always be chosen, and hence we 
allow a singleton time move. Given a limit-robust move mrob\ for player 1, and a move mi 
for player 2, the set of possible outcomes is the set 

e'the ^ mro ^ 1 = -'-*) an< ^ 7711 = mroo \\ or 

(b) mrobi = ([a, 0\,a\) and m\ = (A,ai) with A G [a, f3] 

A limit-robust strategy ir r ° b for player 1 prescribes limit-robust moves to finite run 
prefixes. A limit-robust strategy 7r r ° b is receptive if for all player-2 strategies TT2, all states 
s G S, and all runs r G Outcomes(s, vr^ ob , 7^), either r 6 Timediv or r G Blamelessj. We let 
II™ b denote the set of limit-robust strategies for player- 1 and ~\j r ° b > R the set of limit-robust 
receptive strategies. Given an objective <3?, let RobWinTimeDiv^^) denote the set of states 
s in T such that player 1 has a limit-robust receptive strategy 7r r ° b G H r ° ' such that for 
all receptive strategies 112 G II^, we have Outcomes(s, -ir[ ob , 7^) C We say a limit-robust 
strategy Tt r ° b is region equivalent to a (non-robust) strategy tt\ if for all runs r and for all 
k > 0, the following conditions hold: 

(i) if 7ri(r[0..fc]) = (A, _U), then 7c[ ob (r[0..k}) = (A', with Reg(r[A:] + A) = Reg(r[A:] + 
A'); and 

(ii) if 7ri(r[0..fc]) = (A, 01) with a x / ±*, then vr[ ob (r[0..A;]) = ([a,/3],ai) with Reg(r[k] + 
A) = Reg(r[/c] + A') for all A' G [a,0\. 

Note that for any limit-robust move ([a,/3],ai) with a\ 7^ _L* from a state s, we must have 
that the set {s + A | A G [a, /?]} contains an open region of 7. 
We first present an extension to Lemma 12.71 

Lemma 4.1. Let 7 be a timed automaton game and 7 be the corresponding enlarged game 
structure. Let be an u -regular region objective of 7. If tti is a region strategy that is 
winning for $ from Win^($) and ir r ° b is a robust strategy that is region- equivalent to ir\, 
then ir r ° b is a winning strategy for i from Win^(<&). 

Proof. Consider any strategy tt2 for player 2, and a state ~s G Win^($). We observe that 
the set Outcomes(s, 7rJ ob , 7T2) consists of runs r such that for all k > 0, either 

(1) ir[° b (r[0..k]) = (A,±i) and r[k + 1] G ? jd (r[A;], (A, 7r 2 (r[0..fc])); or 

(2) TT[ ob (r[0..k}) = ([a,P],ai) for some /3 > a > 

with r[k + 1] G S- jd (r[k], (A, a x ), 7r 2 (r[0..fc])) for some A G [a,0\. 
It can be observed that 

Outcomes(s, 7rJ ob , 7T2) = (^J Outcomes(s, n[, ^2) 



2 We can alternatively have an open, or half-open time interval, the results do not change. 
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7Ti(r[0..fc]) 



where tt[ ranges over all those (non-robust) player-1 strategies such that for runs r G 
Outcomes(s, tt^, 712) and for all k > we have 

'(A,±i) if ir[° b (r[0..k}) = (A,±i) 
JA,ai) if vrf b (r[0..fc]) = ([Q,/3],ai) 

for some A G [a, /?]; and 71^ acts like 7Ti otherwise (note that the runs r and the strategies 
tt[ are defined inductively with respect to k, with r[0] = s). Each player-1 strategy 7Tj in 
the preceding union is region equivalent to n\ since ir r ° b is region equivalent to tt\ and hence 
each tt[ is a winning strategy for player 1 by Lemma 12.71 Thus, Outcomes(s, vrJ ob , 7^) = 
Outcomes(?, ir[, 7T2) is a subset of <£, and hence 7r™ b is a winning strategy for player 1. □ 

We now show how to compute the set RobWinTimeDiv^(<I>). Given a timed automa- 
ton game 7, we have the corresponding enlarged game structure T3 which encodes time- 
divergence (we use the modified blame variable 6/1,3 as in Lemma 13.30 . We add another 
boolean variable to T3 to obtain another game structure 7 ro b- The state space of T ro b is 
S 7 x {true, false}. The transition relation 5 7,oh is such that 

a ? «*((5;r&i),(A J a i >) = {6(s, (A, a*)), rb[) 
where r6 1 = true iff rb\ = true and one of the following hold: 

(i) ai G A%; or 

(ii) ai = 1*; or 

(iii) at G A^ 1 and s + A belongs to an open region of T. 

A region R of 7 is said to be open if for all states s G R we have all the clock values in 
s to be non-integral. Given a location parity index function Q on 7 of order d, we define 
another game structure T^rob (based on T ro b) with the parity index function encoding 

TimeDivBli >3 (Parity(fi)) A □(r& 1 = true) as follows. The state space of T^rob i s S 7 rob = 

5^rob x {0, . . . , d — 1}. The transition relation 5 7 mb is specified as follows (similar to 7 n ). 

For (A,dj) G F 7 (s), we have 6 rob ((s,3, tick, bl\, rb\,p), (A, a^)) = {s' ,tick' ,bl'i,rb\,p') 
where 

• s' = 5 T (s,(A,a i )). 
3' = (3 + A) mod 1. 

• tick' = true iff 3 + A > 1. 

• bl'i = true iff z = 1 (i.e., its a player-1 move). 
true if rb\ = TRUE, dj G j4i U {_!_]_} 

and all clock values in C U {z} are non-integral 
true if rb\ = true and Oj G A^ U 
FALSE otherwise 
max(p, fi(s')) if izcfc = false 
Cl(s') if £icfc = true 



rb\ 



P' 
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The parity index function fij£ b is defined as 

1 if rbi = FALSE 

^rab(( s >3> tick > bh,3, rbi,p)) 



if rb\ = true and tick = bli t s = false 

1 if rb\ = true and tick = false, 6Z 1)3 = true 
p + 2 if rbi = true and tick = true 



Lemma 4.2. Let 7 be a timed automaton game, Parity(O) an objective on 7, and T^mb 
the corresponding enlarged game structure with the parity index function f2*i . Then in the 
structure T^ob, we have Parity (Sl*^ b ) = TimeDivBli j3 (Parity(Sl)) A □(r&i = true). 

Proof. The proof follows along similar lines to the proof of Lemma 12.51 We also observe 
than once rbi becomes false, it stays false in T^rob and hence the parity also stays odd for 
all the following states. Thus, if the maximum of lnfOften(il[:Q b (r)) is even for a run r, then 
we must have □(r&i = true) in the run. □ 

Theorem 4.3. Given a state s in a timed automaton game 7 and an uj-regular location 
parity index function we have s G RobWi nTimeDivJ(Parity(f})) iff 

(s, rbi = true, •) GWin[ rob ( TimeDivBli i3 (Parity(ft)) A D(r&i =true)). 
Proof. 

=>• Suppose player-1 has a winning limit-robust receptive strategy tti for Parity (0), starting 
from a state s in T. We show 

(s, •, •, rbi = true, •) G Win^ rob ( TimeDivBli >3 (Parity(fi)) A a (rbi = true)). 

We may consider tti to be a strategy in T^ob. Since tti is a limit-robust strategy, player-1 
proposes limit-robust moves at each step of the game. Given a state s, and a limit-robust 
move ([a, /3],ai), there always exists a < a' < /3' < f3 such that for every A G [a',/3 1 ], 
we have s + A belonging to an open region of T. Thus, given the limit-robust strategy 
7Ti , we can obtain another limit-robust strategy tt[ in 7, such that for every run r and 
every k > 0, 

(a) if m(f[0..k]) = (A,_U), then <(?[0../c]) = ir 1 (?[0..k]); and 

(b) if 7ri(f[0..fe]) = ([a,/3],oi), then tt[(? [0..k]) = (([a', /3'], m) with [a',P'} C [a,/3], and 
{r[k] + A' | A' G [a 1 , /?']} being a subset of an open region of T^^b. 

Thus for any player-2 strategy tt2, and for any run r G Outcomes((s, true, •), 71^, 7^), 
we have that r satisfies □ (r&i = true). Since tti was a receptive winning strategy for 
Parity(r2), ir[ is also a receptive winning strategy for Parity(f2) as Outcomes(s, iri, 7^) Q 
Outcomes(s, 7r^, 712) for any player-2 strategy tt2- Thus, ir[ enables player-1 to satisfy 
TimeDivBli i3 (Parity(S7)) A n(r&i = true). 

<= Suppose s = (s, rbi = true,-) G Win^ rob (TimeDivBli i3 (Parity(fi)) A a (rbi = 
true)). We show that player-1 has a limit-robust receptive winning strategy from 
state s (and hence from s). Let iri be a winning region strategy for player-1 for the 
objective TimeDivBli 3 (Parity(0)) A □(r&i = true) in 1°^. Since the strategy ensures 
rbi = true for all states in all runs from s, we have that for every run r starting from 
state s, the strategy tti is such that 7ri(r[0..fc]) = (A k ,ai) where either a\ = _L*, or 
r [k] + A fe belongs to an open region R of T°*ob Since R is an open region, there always 
exists some a < f3 such that for every A G [a,/3], we have r[k] + A G R. Consider 
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the strategy 7r r ° b that prescribes a limit-robust move ([a, /?], af) for the history r[0../c] if 
7Q(r[0..fc]) = (A fc ,a^) with a\ ± _U, and 7t[ ob (r[0..k}) = 7ri(r[0..fc]) otherwise. The strat- 
egy 7rJ ob is region-equivalent to 7fi, and hence is also winning for player- 1 by Lemma [4.11 
Since it only prescribes limit-robust moves, it is a limit-robust strategy. It is also a 
receptive strategy as it is region equivalent to the receptive strategy n±. □ 

Theorem 4.4. Let 7 be a timed automaton game, and let 0, be a location parity index func- 
tion of order d. The limit-robust winning set RobWinTimeDivi(Parity(fi))) can be computed 
in time 



0\\StJ?\-\7^ r \ + (\A 1 \* + \A 2 




Proof. Using reductions similar to those in Section El the game on T^*ob can be solved in 
time 0((\Ai\* + \A 2 \*) • (8 • \S^ b |)^ + ^) where s|^ ob is the set of regions of with 
|Tpg° b | = 64 - ( | C | + 1 ) ■ d ■ I^Rgg |- We also need to build a list of regions with valid invariants 

together with edge constraints satisfied at the region. This takes 0(|S , R eg ob | • |Tconstr I) time. 

□ 

We say a timed automaton 7 is open if all the guards and invariants in 7 are from 
Constr*(C). Note that even though all the guards and invariants are open, a player might 
still propose moves to closed regions, e.g., consider an edge between two locations l\ and 1% 
with the guard < x < 2; a player might propose a move from (l\,x = 0.2} to (l%,x = 1). 
The following example shows that player 1 might not have a robust winning strategy in an 
open timed automaton. 

Example 4.5. There exists an open timed automaton game 7 such that for a reachability 
objective <£, player 1 has a receptive winning strategy for <I> from a state s, but does not 
have a limit-robust receptive strategy for <3> from s. 

Consider the open timed automaton game T of Figure El The invariants of all the 
locations are true everywhere. The objective of player 1 is to reach I 2 . The set of player-1 
actions is {a^j }} an d the set of player-2 actions is {a^, a\, a|}. The location Z 3 and I 4 are 
absorbing locations. Consider the location I 1 . If x ^ 1, then player 2 can propose a time 
duration move to either I 4 or I 5 , and hence prevent player 1 from reaching l 2 . If x = 1, then 
player 1 can propose a time duration move to I 2 , which a receptive strategy of player 2 
must eventually allow. Now consider the location 1°. If x > 1 then player 2 can propose a 
time duration move to Z 3 . If x < 1, player 1 wins provided that it moves to the location I 1 
exactly when x = 1. Thus, no limit-robust winning strategy exists from any location (other 
than trivially from I 2 ). □ 



4.2. Winning with Bounded Jitter and Response Time. The limit-robust winning 
strategies described in subsection 14. 1 1 did not have a lower bound on the jitter: player 1 could 
propose a move ([a, a + e],oi) for arbitrarily small a and e. In some cases, the controller 
may be required to work with a known jitter, and also a finite response time. Intuitively, 
the response time is the minimum delay between a discrete action (of either the controller 
or the environment) and a discrete action of the controller. The response time models the 
delay between a location change in a timed game, and when the controller is allowed to 
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Figure 6: An open timed automaton game T with no player-1 limit-robust winning strategy. 



take an action based on the location change. We incorporate the response time in timed 
automaton games by restricting player 1 strategies. The jitter is modeled by expanding 
the set of resulting states to include all the states which lie in a jitter interval around the 
proposed player-1 delay. 

Strategies compatible for ej-jitter e r -response bounded-robust winning. Given a 
finite response time e r , player 2 can always propose pure time moves of duration e r /2. Thus, 
if player 1 is restricted to only playing moves of duration longer than e r , player 2 can ensure 
that player-1 moves are never chosen (by repeatedly playing pure time moves, and action 
moves of duration less than e r ). To allow for such blocking player 2 pure time moves, we 
only restrict player-1 strategies to contain moves that are of duration greater than e r from 
the last time a non-pure time move was chosen. In case either player proposes moves such 
that only time advances, without any discrete action being taken, we adjust the remainder 
of the response time. 

Let £j > and e r > be given bounded jitter and response time (we assume both 
are rational). Formally, a strategy i\\ compatible for ej- jitter e r -response bounded-robust 
winning of player 1 proposes a move 7Ti(r[0../c]) = (A,a±) such that if 

r[0..k] = s , {m\,m Q 2 ),s u (m\,m l 2 ), ...,s k 

then at least one of the following conditions holds. 

• For all < j < k, we have 

- if blamej(sj, m{, m J 2 , Sj+i) = true, then mj = (Aj, _Lj) for i <G {1, 2}. 

„z,i . , ... I 1 if blamei (sj, m\ , ml, = true 

- A > e r - V'_ n A 3 ,., where g(j) = < v J ' 1: 2 ' J+1; 

^1-0 gW w | 2 otherwise. 

- {m k l +e\ee [0,£j]} C T^Sk). 

This corresponds to the case where in the entire run r[0..k], (a) no discrete actions have 
been taken, only simple time moves; (b) player 1 can take a non-pure time move only 
after e r time units from the start of the run; and (c) the moves m\ + e must be legal 
player-1 moves for all e € [0,£j]. 

• There exists a p with 1 < p < k such that all of the following hold. 
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— blamei(sp_i, m^ -1 , m^ -1 , s p ) = true, for i G {1,2}, and m^ 1 = (A? -1 , af _1 ) with 
a 1 ^ 1 / ±i (i.e., the state r\p] arises in the run r due to a non pure-time move). 

— For all p < j < k, if blame^Sj, m{, m J 2 , Sj+i) = true then we have m^ = (Aj, for 
i G {1, 2} (i.e., only simple time passage moves are taken after the p — 1-th stage). 

a ~_ a i i / . \ I 1 if blamei (s,-, mi, mn, = true 

- A > e r - A J ,.,, where 0(7) = < v 3 1 2>j+i; 

^j-p 90)' yyJ! | 2 otherwise. 

- {m k 1 +e\ eG [0,£j]} C ri(a fc ). 

This corresponds to the case where in the run r[0..fc], (a) the last non-pure move was 
taken at r\p — 1]; (b) only simple time passage moves are taken from r[p — 1] till r[k]; 
(c) player 1 has to wait e r time units after the discrete action at r[p — 1] to propose a 
new discrete action at r[k]; and (c) the moves m\ + e must be legal player-1 moves for all 
e€ [0,£j]. 

ej-jitter £ r -response bounded-robust winning. Given a move m\ = (A, a±) of player 1 
and a move 771,2 of player 2, the set of £j-jittered states is given by {<5jd(s,mi + £,777-2) I 
e G [0, £j]}. Given a player-1 strategy tti compatible for ej-jitter £ r -response bounded-robust 
winning of player 1, and a strategy 7T2 of player 2, the set of possible outcomes in the present 
semantics is denoted by OutcomeSj r (s, ir±, 7^). Each run r in Outcomes^ (s, 7Ti, 1:2) is such 
that for all k > 0, the state r[/c + 1] belongs to the set of £j-jittered states arising due to 
the moves iTi(r[0..k]) and ^(rfO-.A;]) of player 1 and player 2 respectively. We denote the 
ej-jitter £ r -response bounded-robust winning set for player 1 for an objective <3? given finite 

£j and £ r by JRWinTimeDivJ* ej * £r ($). 

The timed automaton 7 £ v £r for computing JRWinTimeDiv^' £j,£r ($). We now show 

that JRWinTimeDivi* J ' r ($) can be computed by obtaining a timed automaton T e i ,£r from 

T such that WinTimeDiv^ ($) = JRWinTimeDiv^ J ' r ( ( J > ). Given a clock constraint ip we 
make the clocks appearing in ip explicit by denoting the constraint as tp( x ) for x = 
[xi, . . . , x n \. Given a real number 5, we let <p(lt + 5) denote the clock constraint cp' 
where ip 1 is obtained from p by syntactically substituting Xj + S for every occurrence 
of xj in ip. Let / £ j : Constr(C) i-> Constr(C) be a function defined by / £ j (y>( x )) = 
ElimQuant (V<5 (0 < <5 < £j — > ^?(x +5))), where ElimQuant is a function that eliminates 
quantifiers (this function exists as we are working in the theory of reals with addition, 
which admits quantifier elimination). The formula f £ '(p) ensures that p holds at all the 
points in + A I A < £j}. 

We now describe the timed automaton T £ j' £r such that 

WinTimeDivf J ' £r ($) = JRWinTimeDiv[' £j ' £r ($). 

The automaton has an extra clock z in addition to the clocks of 7. 

Locations: Corresponding to each location I of T with outgoing player-1 edges e\, . . . , ef" , 
the automaton T e J ,£r has m + 1 locations: l,l e i,... ,l e ™- The invariant for I is the same 

as the invariant for I in T. The invariant for Lk is z < e\ for all k. 

1 

Actions: The automaton T e J ,£r has the following actions: 

• The set of actions for player 1 is {(1, e) | e is a player-1 edge in T}. 

• The set of actions for player 2 is A^VJ{(a2, e) | 02 G and e is a player-1 edge in T} U 
{(2, e) I e is a player-1 edge in T} (we assume the unions are disjoint). 
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Edges: The automaton T £ J' £r has the following edges (every edge includes z in the reset 
set): 

• For (/, ct2, <p, I', A) a player-2 edge of T, the automaton T £ j' £r contains the player-2 edge 
(l,a 2 ,<p,l', A U {z}). 

• For every player-1 edge ej = (I, a{,ip, I', A) of 7, the location / of T £ J' £r has the outgoing 
player-1 edge (I, (l, ej ), f £ > ( 7 T (0) A (z > e r ) A / £ i(^), l e ., A U {z} ). 

• For (I, a 2 , <f 2 , 1', A) a player-2 edge of 7 and ej a player-1 edge from I, the location l e . 
of the automaton T e J' £r has a player-2 edge (Z e ., (02, ej), ip 2 , I', A U {z}). 

• For every player-1 edge ej = (I, a\, (p±, I', A) of T, the location Z ej of T £ -i ,£r also has an 
additional outgoing player-2 edge {l ej , (2, ej), <pi, I', A U {z}). 

The automaton T £ J' £r as described above contains the rational constants e r and ej. We can 
change the timescale by multiplying every constant by the least common multiple of the 
denominators of e r and ej to get a timed automaton with only integer constants. 
The role of the different edges in T £ J' £r is described below. 

• A player-2 edge in 7 £ >' £r labelled with 0,2 such that a 2 6 A\ corresponds to the player-2 
edge labelled 02 in 7. 

• Player 1 moving from I to l ej with the edge labelled (1, ej) indicates the desire of player 1 
to pick the edge ej from location I in the game T. This is possible in 7 iff the following 
conditions hold. 

(a) More that e r time has passed since the last discrete action. 

(b) The edge ej is enabled for at least ej more time units. 

(c) The invariant of I is satisfied for at least ej more time units. 

These three requirements are captured by the new guard in T £ j' £r , namely [z > e r ) A 

r%)A/ £ j( 7 T (o)- 

• Consider a location l Cj in T £ J' £r . If the game is at l ej , then it corresponds to the situation 
in T where player 1 has picked the edge labelled ej from location I, and it is up to player 2 
to allow it or not. The presence of jitter in T causes uncertainty in when exactly the edge 
ej is taken. This is modeled in T £ j ,£r by having the location l ej be controlled entirely by 
player 2 for a duration of ej time units. Within ej time units, player 2 must either: 

— propose a move (02, ej) (corresponding to one of its own moves 02 in 7), or, 

— allow the action (2, ej) (corresponding to the original player-1 edge ej) to be taken. 

Given a parity function on 7, the parity function Q7 '' ' on T £ j' £r is given by '' ' (I) = 
' (l e ) = f2 T (Z). In computing the winning set for player 1, we need to modify blamei for 
technical reasons. Whenever an action of the form (1, ej) is taken, we blame player 2 (even 
though the action is controlled by player 1); and whenever an action of the form (2, ej) is 
taken, we blame player 1 (even though the action is controlled by player 2). Player 2 is 
blamed as usual for the actions (02, ej). This modification is needed because player 1 taking 
the edge ej in T is broken down into two stages in T £ J ,£r . If player 1 were to be blamed for 
the edge (l,ej), then the following could happen: 

(a) player 1 takes the edge (1, ej) in T £ j' £r corresponding to its intention to take the edge ej 
in 7, and 

(b) player 2 then proposes its own move (a 2 ,ej) from l ej , corresponding to it blocking the 
move ej by a 2 in 7. 

If the preceding scenario happens infinitely often, player 1 gets blamed infinitely often even 
though all it has done is signal its intentions infinitely often, but its actions have not been 
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Figure 7: The timed automaton game 7 £ '' £r obtained from T. 

chosen. Hence player 2 is blamed for the edge (1, ej). If player 2 allows the intended player 1 
edge by taking (2, ej), then we must blame player 1. We note that this modification is not 
required if e r > 0, as in this case player- 1 signalling its moves infinitely often via moves 
of the type {l,ej) can only happen if time progresses by e r infinitely often, which implies 
time divergence. The construction of T £ j' £r can be simplified if ej = (then we do not need 
locations of the form l e ). 

Example 4.6 (Construction of 7 £ '" £r ). An example of the construction is given in Figure[7J 
corresponding to the timed automaton of Figure [TJ For the automaton T, we have A\ = 
{a\,a1,af,af} and A2 = {a^, ct|, T ne invariants of the locations of T are all true. 
Since T has at most a single edge from any location V to l k , all edges can be denoted as e^- 
The set of player-1 edges is then {eoi, eo2, e2o, eio}- The location Z 3 has been replicated for 
ease of drawing in 7 £ ' ,£r . The location Z 3 is also an absorbing location — it only has self-loops 
(we omit these self loops in the figures for simplicity). Observe that f £ '(x < 1) = x < 1 — £; 
and f £ i(y > 1) = y > 1. □ 

Given a set of states S of 7 £ > ,£r , let JStates(5) denote the projection of states to T, de- 
fined formally by JStates(5) = {(I, k) I is a location of T and (1,k) € S such that k{x) = 
k(x) for all x £ C}, where C is the set of clocks of 7. The next theorem states that the 
timed automaton game 7 £ ' ,£r can be used to compute the winning states in 7 from which 
player 1 has an ej-jitter e r -response bounded-robust winning strategy. The proof of correct- 
ness of the construction follows from the arguments given in the description of 7 £ '" £r . 
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Theorem 4.7. Let 7 be a timed automaton game, e r > the response time of player 1, and 
£j > the jitter of player 1 actions such that both e r and £j are rational constants. Then, 
for any co-regular location objective Parity of 7, we have 

JStates (\z = 0] n WinTimeDivf J ' er (Parity^"'))) = JRWinTimeDiv[' £j,er (Parity(0 T )), 



where JRWinTimeDiv^ r ' £i ' £r (<I>) is the winning set in the jitter-response semantics, 7 £ '^ 6 ' is 
the timed automaton with the parity function 07 '' ' described above, and \z = 0] is the set 
of states o/T e j' £r with k(z) = 0. □ 

Theorem 4.8. Let 7 be a timed automaton game, e x > the response time of player 1, and 
£j > the jitter of player 1 actions such that both e r and £j are rational constants. Then, 
for any co-regular location objective Parity of 7, the winning set 

JRWinTimeDiv[' £j ' £r (Parity(ft T )) 

can be computed in time 

( d+2 , 3 - 
(\SlX ,Q \ • I^Constrl 2 )) + l^ll • |A 2 | • (8 • \S%:f' n \) 3 2 

where S^^" is the set of regions of7 e > ,£r with 

\S 7 Re f" n \ = \Sl g \ ■ 128 • (|C7| + 1) • (|C| + 2) • d ■ (1^1 + 1). 

• max ( num(£j), num(e r )) • ( lcm(denom(£ r ), denom(£j))) ' C '' +1 

in which 

• S Reg is the set of regions of 7; 

• lcm() is the least common multiple function, denomQ and num() are the denominator and 
numerator functions respectively; 

• |Tconstr| "Is the length of the clock constraints in 7. 

Proof. Let the timed automaton game 7 have \L\ locations and \Ai\ player- i edges for 
i G {1,2}. The automaton 7 e >> £ < has \L\ • (1 + \A X \) locations, (\A X \ + \A 2 \ + \A X \ ■ \A 2 \) 
player-2 edges, and \A\\ player-1 edges. Given rational constants £j and £ r , all the constants 
in the system need to be multiplied by the least common multiple of the denominators of 
£j and £ r . The timed parity game T e J ,£r can hence be solved in time 

d+2 i 3 > 



( d+2 I 3 \ 
l^i|-|^|-(8-^ n ) 3 +5 J 



For every player-1 edge e,- = (l,a{,ip,l',X) we need to obtain /ej-lcm({denom( er ),denom( ej )})^)_ 
This takes time 0(M 2 ) (see [BPR03] ). We observe that /q-lcm({denom( £r ),denom( ej )})^^ can . 
not have any constants other than those in <p, and £ r ,£ r (this can be seen by putting <p in 
a disjunctive normal form and applying a Fourier-Motzkin like quantifier elimination pro- 
cedure [Sch86]). Thus, building a list of regions of 7 £ >' £r with valid invariants together with 
edge constraints satisfied at the regions takes time 0(\S^ g £r,Q \ ■ (Tconstrl 2 ) D 
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Figure 8: General timeline of a run in the game of Fig. Q] (decreasing sequence of timegaps 
a>). 



Example 4.9 (Differences between various winning modes). Consider the timed automaton 
Tin Fig. [TJ Let the objective of player 1 be n(-i/ 3 ), i.e., to avoid / 3 . The relevant part of the 
automaton for this example is the cycle 1°, I . The only way to avoid Z 3 in a time divergent 
run is to cycle in between 1° and I 1 infinitely often. In additional player 1 may choose to 
also cycle in between 1° and I 2 , but that does not help (or harm) it. In our analysis, we 
omit such l°,l 2 cycles, noting that our receptive formulation correctly allows any number 
of l°,l 2 cycles. We present an intuitive explanation here, a detailed analysis can be found 
in the appendix. 

Let the game start from the location I". In a run r, let t\ and t\ be the times when the 
a\ transition and the a 2 transitions respectively are taken for the j-th time. The timeline 
is depicted in Figure The guards x < 1 on a\ and y > 1 on a 2 ensure that the distance 
between the j + 1-th a\ transition, and the j-th. a 2 transition keeps on strictly decreasing 
with increasing j. To see this, observe that t{ +1 — t{ < 1 because of the guard x < 1, and 
t\ — 4 _1 > 1 because of the guard y > 1 . Rearranging, we get t\ +1 — < t{ — t^ 1 . Consider 
the time gap sequence = t\ + — t J 2 , i.e., the sequence of time gaps between the j + 1-th 
a\ transition, and the j-th a 2 transition. For any e-jitter strategy of player 1 with e > 0, this 
sequence must decrease by more than e for each step, which clearly cannot happen infinitely 
often since a? must be positive for all j, as the j + 1-th a\ transition must always happen 
after the j-th. a 2 transition. Thus, player 1 has no e-jitter bounded-robust winning strategy 
from 1°. Player 1 does however have a limit-robust strategy, as a limit-robust strategy can 
be such that the time gap sequence a? decreases, but by a smaller and smaller amount at 
each step, ensuring that a? stays positive for all j. This shows that e-jitter bounded-robust 
winning strategies for e > are strictly less powerful than limit-robust strategies. 

To see that limit-robust strategies are strictly less powerful than general receptive strate- 
gies, observe that player 1 does not have a winning limit-robust strategy from (1°, x = y = 1) 
as it would have to take the first a\ transition immediately. It can be shown that there 
exists a winning player-1 receptive strategy from (l°,x = y = 1). □ 

The following theorem follows from the fact that bounded-robust winning strategies 
are no more powerful than limit-robust strategies, which are in turn no more powerful than 
general receptive strategies. The strictness of the inclusions can be observed in Example 14.91 



Theorem 4.10. Let 7 be a timed automaton and $ an objective. For all ej > and e r > 0, 
we have JRWinTimeDiv^ j ' er ($) C RobWinTimeDivi($) C WinTimeDivi($). All the subset 
inclusions are strict in general. □ 
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Sampling semantics. Instead of having a response time for actions of player 1, we can 
have a model where player 1 is only able to take actions in an ej interval around sampling 
times, with a given time period e S ampie- A timed automaton can be constructed along similar 
lines to that of T e J' £r to obtain the winning set. 
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5. Appendix 

5.1. Proofs of Section [2]. Proof of Lemma 12.51 

Proof. Consider a run r of T Qt . We show that the maximum index visited infinitely often 
is even iff the run f satisfies ((□<> tick — > Parity(Sl)) A (-■□<> tick — > OD-i bl\)) 

Suppose the run r satisfies ((□<> tick — ^ Parity(fi)) A (->□<> tick — ^ OD-i bl{j) We show 
the maximum index visited infinitely often is even. The following cases can arise. 

(a) The run r satisfies -iDO tick and <>□— ■ 6Zi, i.e., r[j] has tick = bl\ = false for all 
j > n for some n. In this case the parity seen infinitely often is (even). 

(b) The run r satisfies □<> tick and also belongs to Parity(Sl). The component p of the 
state in T^ td remembers the maximum index seen since the state following the 
last occurrence of tick = true. That is, in a run r , if r[j] has tick = true, and 
r[j + 1], . . . r[j + m] all have tick = false (except possibly for r[j + m]), then for 
j + l<k<j + m the value of p in r[k] is equal to max{il(r[z]) | j + 1 < i < k}. 
Since tick is true infinitely often, the maximum O td index seen infinitely often is 
m + 2 where m is the maximum fi index seen infinitely often in r. Since r belongs 
to Parity (fi), m is even. Thus the parity seen infinitely often is m + 2 (even). 

<= Suppose the run f does not satisfy ((□<> tick — > Parity(Sl)) A (->□<> tick — > OD-i bl±)). 
We show the maximum index visited infinitely often is odd. The following cases can 
arise. 

(a) The run r satisfies -iDO tick and □<> bl\. In this case tick = false for all j > n for 
some n; and bl\ = true infinitely often. Thus, the maximum index seen infinitely 
often is 1. 

(b) The run r satisfies □<> tick and also belongs to Parity ($7). As above, the maximum 
Qtd j nc j ex s een infinitely often in r is m + 2 where m is the maximum fi index seen 
infinitely often in f. Since r does not belong to Parity (Q), m is odd. Thus the parity 
seen infinitely often is m + 2 (odd). □ 

Proof of Lemma 12. 7L 

?rn td 

Proof. Consider the //-calculus formula ip for describing the winning setWini (Parity(ft td )). 
The formula contains the CPrei operator. The set CPrei(Z) remains unchanged (for Z a 
union of regions of T^ td ) if player 1 is restricted to use only memoryless strategies. Suppose 
s € CPrei(Z), and let m\ be the winning move of player 1 from s such that no matter 
what player 2 does, the next state lies in Z. Let R\ = Reg(<5(s, m\ )), and let the available 
moves of player 2 from Is be to regions R\, . . . , R^- We have that from any state in Reg(s), 
player 1 has a move to Ri, and that player 2 can only take moves to R\, ■ ■ ■ , R%- From 
Lemma 13. 11 it then follows that if player 1 proposes a move to R\ from any state in Reg(s), 
then no matter what player 2 does, the resulting state will lie in Z. Thus, memoryless 
region strategies suffice as winning strategies. Moreover, again from Lemma 13. 11 any move 
of player 1 to the region R\ is a winning move. Thus, we have that there is a memoryless 
region winning strategy 7Ti from winning states, and that any strategy region equivalent to 
7Ti is also a winning strategy. □ 
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Proof of Proposition 12.81 

Proof. Intuitively, in the structure T, we want player 1 to be able to infer the values corre- 
sponding to 3, tick, Mi, p. If player 1 can do this, then it can maintain the structure 7 n 
in memory, and thus it can use a winning memoryless strategy of T^ 2 (by Lemma 12.71 
memoryless strategies suffice in T^ td ). This strategy will then be move-independent. The 
values of 3 and tick can be inferred from the value of the global clock z. And given a 
state r[k] in a run r, and a move (Ai,ai) of player 1, the bl\ component will be true iff 
(time(r[fc + 1]) — time(r[&])) = Ai and <5(r[fc], (Ai,ai)) = r[k + 1]. The value of the compo- 
nent p can be inferred from the parity values of £1, and the values of tick, bl\. □ 

Proof of Lemma 12.91 

Proof. Consider the CPrei operator in the ^-calculus formula for describing the winning 
set Wir^ (Parity (f2 td )). The set CPrei(Z) remains unchanged (for Z a union of regions 
of ) if player 2 is restricted to playing only move-independent region strategies. This 
is because from it can be shown from Lemma 13.11 that from any state s, the ability of 
player 2 to prevent player 1 from reaching Z in the next step depends only on Reg(s), the 
regions in Z, and the current move of player 1. Moreover, if player 2 can prevent player 1 
from reaching Z from s for all player 1 moves, then there is a unique region R* such that 

for all s' E Reg(s), against any player 1 move m\ , player 2 has a counter-move m 2 ' 1 

with (js' ,rn 2 ' mi ) E R* such that the move m 2 ' mi prevents the player-1 move mf' 
from reaching Z. Thus, move-independent region strategies of player 2 suffice as spoiling 
strategies. □ 



5.2. Proofs of Section [3l We start with the statement of a classical result of [AD94] that 
the region equivalence relation induces a time abstract bisimulation on the regions. 

Lemma 5.1 ([AD94]). Let Y, Y' be regions in the timed game structure T. Suppose player i 
has a move from s\ E Y to s[ E Y' , for i E {1,2}. Then, for any S2 E Y, player i has a 
move from S2 to some s' 2 &Y'. □ 

Proof of Lemma 13.11 

Proof. From Lemma |5.1[ if player i has a move from some s\ E Y to E Y' , for i E {1, 2}. 
Then, for any S2 E Y, player i has a move from S2 to some s 2 E Y' . 

Consider the case when Y{ 7= Y 2 . The proof follows from the fact that each region has 
a unique first time- success or region. Thus, if Y[ is "closer" to Y than Y 2 , then the move of 
player 1 wins, otherwise, the move of player 2 wins. A region R' is a first time-successor of 
R 7^ R' if for all states s E R, there exists A > such that s + A E R' and for all A' < A, 
we have s + A' E R U R'. The time-successor of {I, h,V{C)) is (/, h',V'{C)) when 
• h = h', V(C) = (C_i,C + %,C x ,...,C n ), and V'(C) = {C^,C = 0, C[, . . . , C' n+1 ) 

where C[ = Cj_i, and h(x) < c x for every x £ Co. 
. h = h', V{C) = (C_!,C + 0,C 1 ,...,C n ), and V\C) = (C'_ x = C_i U C ,C = 

0, C\, . . . , C n ), and h(x) > c x for every x E Cq. 
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• h = ti, V(C) = (CU,C + 0,C 1 ,...,C n ), and V\C) = {C'_^C' Q = 0, C[, . . . , C' n+1 ) 

where C[ = Cj_i for i > 2, h{x) < c x for every x G C{ C Co, and /i(x) > c x for every 

x G C7 \ C( , and C'_ x = C7_i U C \ C[ . 
. V(C) = (C_i,C Q = 0,C 1 ,...,C7 n ), 7>'(C) = (C7_!,C7 = C„, C x , . . . , C n ^), and h'(x) = 

h(x) + 1 < c x for every x G C n , and h'(x) = h(x) otherwise. 
. V(C) = (C-i.Cb = 0,Ci,...,C n ), 7>'(C) = (C-i = C_iUC n ,Co,C7i,... ) C n „i), and 

h'(x) = h(x) = c x for every x G C n , and /i'(x) = /i(x) otherwise. 
. V{C) = (C- 1 ,C = ®,C 1 ,...,C n ),V'(C) = (C'_ 1 = C„ 1 UC n \C' ,C' ,C 1 ,...,C n -i), l md 

h'(x) = h(x) + 1 < c x for every x G C[ C C n , /i'(x) = /i(x) = c x for every x G C n \ C[, 

and /i'(x) = h(x) otherwise. 
In case Y[ = Y 2 ', then player 2 can pick the same time to elapse as player 1, and ensure that 
the conditions of the lemma hold. □ 

Completion of proof of Lemma 13.51 

Proof. (Continued). 

We constructed the 3-region strategy n\ from tti , and we claimed tt\ was a winning strategy. 
We were proving it by contradiction. We assumed a spoiling strategy 7r| for 7r*, and we 
constructed a player-2 strategy 7T2 that we claimed was spoiling for tti. We were showing 
by induction that there exists a run P3 G Outcomes(s, tti, ^2) such that r^[k\ = r^[k] for all 
k > (r*3 was the run used in defining 7T2, and is such that f| G Outcomes(s, 7r*, 7r|) and 
f| ^ TimeDiv3Bli(Parity(f2))). We present the details of the induction proof. The proof of 
the above claim is by induction on k. For k = the claim is trivially true. Suppose the 
claim is true for all j < k. Thus, we have a run F3 such that r%\j\ = for all j < k. 
We show that the run p3[0..fc] can be extended to r^O-.k + 1] according to 7ri,7T2 such that 
r^[k + 1] = rt\k + 1]. We have the following cases: 

(1) The following hold. 

• 7ri(r 3 [0..fc]) = (A,ai), and 

• a\ = _U. 

Since n\ is a memoryless strategy of , we have 7Ti(f|[0../c]) = (A, _!_*). Sup- 
pose ^(^[O--^]) nas the form (A2,a2). By definition of 7T2, we have ^(fsfO-.A;]) = 
7r2(r3[0..fe]) = 7T2(f^[0..fc]) = (A2,a2) (vr2 is a memoryless strategy). Hence, we have 

r* 3 [k + 1] = (9* 3 [k], (A 2 , 02)) = ^ (f 3 [fc], (A 2 , a 2 )) =r 3 [k + l}. 

(2) The following hold. 

• vr 1 (f 3 [0..fc]) = (A, 01), 

• a\ 7^ _L*, and 

• Succrg" (f 3 [/c], A) = TRUE 

Since iri is a memoryless strategy, we have 7Ti(fJ[0../c]) = (A,oi). Combining this 

fft} td 

with Succr 3 (r^[k],A) = true, yields 7r*(r^[0../c]) = 7ri(r^[0..fc]) = (A, ai). Let 
7r 2 (^3[0..A;]) = (A 2 ,02)- By definition of 7T2, we have ^(r^fO../;;]) = ^(f^fCA;]) = 
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7T2(f|[0..fc]) = (A 2 ,a 2 ) (tt 2 is a memoryless strategy). Thus, 

(? 3 [k],n 1 (? 3 [0..k]),7r 2 (? 3 [0..k]))=5^ (r 3 [k],{A, ai ),(A 2 ,a 2 )) 



?fn td 

= (r* 3 [k],(A, ai ),(A 2 ,a 2 )) 

Hence f 3 [0../c] can be extended to r 3 [0..k + 1] according to n±, n 2 such that f 3 [0..£; + 1] = 
r%[0..k + l\. 

(3) The following hold. 

• vri(f 3 [0..A;]) = (A,ai),ai / 1*, 

• Succrj" (f 3 [/c], A) = FALSE, 
. 7rJ(r 3 [0..fc]) = <A',±i), and 

• 7r5(f 3 [0..fc]) = (A*, a*) with A* < A'. 

We have vr 2 (f 3 [0..fc]) = 7r|(f 3 [0..fc]) = (A*,a* 2 ). Since A' < A (by definition of ttJ), we 
have A* < A, hence 

r 3 [fc + 1] € ^Vstfc], <A,ai), (A*,a 2 )) = {<^(f 3 [fc], (A*,a 2 ))}. 
Since tt^,^ are memoryless, we have 

7rt(fS[0..fc]) = 7r;(fs[0..fc]) = <A',± 1 > 

vr 2 *(f|[0..A ; ])=7r 2 *(f 3 [0..fc]) = (A*,a 2 ). 
Thus, Tg[fe + 1] belongs to 

67f (r* 3 [k],(A',± 1 },(A*,a 2 )) = {6 7 soi (r* 3 [k], (A*, a 2 ))} = {6 7 ™ (r 3 [k\, (A*,a 2 ))}. 

(4) The following hold. 

• vri(f 3 [0..fc]) = (A,ai),ai / _L*, 

• Succrj" (f 3 [fc],A) = FALSE, 

• 7rJ(f 3 [0..fc]) = (A', and 

• 7r|(f 3 [0..fc]) = <A*,a5) with A* > A'. 

We have vr 2 (f 3 [0..fc]) = (A',_L 2 ) by definition. Since A' < A(by definition of 7rJ), we 
have 

h[k + 1] G 4^ (f 3 [fc], (A, ai), (A, j_ 2 » = (f 3 [fc], (A, j_ 2 »}. 
Also, since 7r^ and 7r 2 are memoryless, we have 

= {6^(r* 3 [k],(A',U))} 

= {5 7 ™ (r 3 [k], (A',_Li))}. 

Thus, f 3 [fc + l] and f|[/c + l] are the same except for perhaps the 3bl\ component. Since 

Succr 2 (r 3 [k],A r ) = false (by definition of 7r*), we must have 3bh = false in both 
r 3 [k + 1] and f|[fc + 1]. Hence r 3 [k + 1] = r$[k + 1]. 

(5) The following hold. 
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• 7ri(f 3 [0..fc]) = (A,ai),oi + U, 

• SuccrJ (f 3 [k], A) = false, 
. ir* 1 (r 3 [0..k]) = (A',± 1 ), 

• 4(f 3 [0..fc]) = (A*, a* 2 ) with A* = A', 

• <5 t ?m (r|[A;],7r|(f|[0..A;])) =f|[it + l], and 

• (J* [ife] , ttJ (f£ [0.. fc] )) 7^ ^ [jfe + 1] . 

We have tt 2 (? 3 [0../c]) = ir%(r 3 [0..k]) = (A*,a* 2 ) by definition. Also irl(r£[0..k]) = 
7r*(f 3 [0../e]) = (A,J_i); and 7r*(f|[0..fe]) = 7r*(f 3 [0..A;]) = (A*, a*} since tt* and vr* 



f* J Qi 



are both memoryless. Now, f 3 [/c + 1] belongs to S^ bl (r' 3 [k],Tri(r 3 [0..k]),ir2(r 3 [0..k])), 
and 



(r3[fc],7ri(f 3 [0..fc]),7r 2 (f 3 [0..A;])) = 6 J ™ (r 3 [k], (A, ai), (A*, a* 2 )) 

= {8^(r 3 [k],(A*,a* 2 ))} 



= {S^(r 3 [k],(A*,a* 2 ))} 
= {r* 3 [k + l]}. 

Thus, we have r 3 [k + 1] = r^[k + 1]. 
(6) The following hold. 

• 7Ti(f 3 [0..fc]) = (A,ai),ai / _L*, 

• Succrg" (r 3 [k], A) = false, 

• 7r;(f 3 [0..A;]) = (A',±i>, 

• vr|(f 3 [0../c]) = (A*,a^) with A* = A'; and 

. 5^(^[k],7T* 1 (?* 3 [0..k]))=?* 3 [k + l]. 

We have 7r 2 (f 3 [0..fc]) = (A*,± 2 ) by definition. Also vr*(f^ [0..A;]) = 7rf(f 3 [0..fc]) = 
(A',±i); and ^(r^CLA;]) = 7r|(f 3 [0../c]) = (A*,^) since ir\ and ^ are both memo- 

?ffi td 

(""Oil 



ryless. Then, f 3 [fc + l] € <5 jd 3W (r 3 [fc], 7n(f 3 [0..A;]), 7r 2 (f 3 [0..fc])) and 

S- d 3bl (f 3 W,vr 1 (f 3 [0..A:]),7r 2 (f 3 [0..A:])) = ( 5 j > i (f 3 [fc], (A, ai), (A*, ± 2 » 

= {^ d (f 3 [fc],(A*,± 2 })} 
= {^ d (f* W , (A *,± 1 ))} 

since Succrg (r^[k], A) = SuccnJ (f 3 [A;], A) = false, and A' = A* and 

Succi-3 (fg[A;], A') = true. 
We have , 

^o td ??o td 

? * [k + 1] = S^ (?*[k],(A*,± 1 )) = 5 T ™ (r* 3 [k],(A',± 1 )) 
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and 



&»m[k), <a*,±!)) € sjfm}- <c";[o..A])- 



^ d (f^[A ; ],(A',± 1 ),(A* J 4}) 



= 8^(r 3 [k},{A',± 1 ),(A*,a* 2 )) 
3? 3 [k + l] 

Thus, the run f^fCA;] can be extended such that r 3 [£; + 1] = r%[k + 1]. 
Thus, in all cases, we have that f 3 [0..&:] can be extended to r 3 [0..k + 1] according to 
7ri,7T2 such that f 3 [0../c + 1] = f|[0..A; + 1]. Hence, we have r$ G Outcomes^, m, tt 2 ) and 
f~3 ^ TimeDiv3Bli(Parity(fi)) as f| ^ TimeDiv3Bli(Parity(il)), a contradiction since -k\ was 
assumed to be a winning strategy. Hence, we cannot have the existence of the strategy ir^ 
from which and tt 2 were derived, i.e., irl is a winning strategy for player 1 from s'. O 

Completion of proof of Lemma 13.71 

Proof. (Continued). 

We continue to show that if s G CPrei )3 (Z), then s£ CPrei(Z). Suppose s G CPrei i3 (Z). 
We can have the following cases. 

(1) {<5(s, (A,a 2 )) | (A,a 2 ) G T 2 (s) and Succr 3 (s, A) = true} C Z. 

Consider the cardinality of the set B = {Reg(<5(s, (A, _L 2 )) I (A, _L 2 ) G r 2 (s))}. 

• If < 3 then we have 

{5(s, (A,a 2 )) | (A,a 2 ) G r 2 (s) and Succr 3 (s, A) = true} 
={8(s,(A,a 2 )) | (A,a 2 )er 2 (s)}. 

Hence s£ CPrei(Z) in this case (when < 3). 

• Suppose \B\ > 3. Then, there exists A such that (A, a 2 ) G T 2 (?) (and hence (A, G 
Ti(?)) with |{Reg(s + A) | A < A}| = 3. 

Consider the player-1 move (A,J_i). Since (A,a 2 ) G T 2 (s), we must have (A,_L 2 ) G 
r 2 (s). By assumption, we have 

<f(s,<A,± 2 » ez 

(A, a 2 ) G T 2 (s) and 
Succr 3 (s, A) = TRUE 

Since A is such that Succr 2 (?, A) = false, we have 5(s, (A, _L 2 )) = 5(s, (A, J_i)) (the 
3bli component is false in both cases). Hence, 5(s, (A,_l_i)) G Z. Also, since 

* {5(s, (A',a 2 )) | (A',a 2 ) G T 2 (s) and Succr 3 (s, A') = true} C Z and 

★ |{Reg(s + A') | A' < A}| = 3 
we have 

{6(s, (A',a 2 » | A' < A, and (A',a 2 ) G T 2 (s)} C Z. 
Hence s G CPrei(Z) in this case by the second condition of CPrei(Z) (the winning 
move of player 1 being (A,_l_i)). 

(2) There exists (A,ai) G Ti(s) with a\ ^ _L* such that 

(a) 8{s, (A,ai)) G Z, and 

(b) {S(s, (A',a 2 )) | A' < A, Succr 3 (?, A') = true, and (A',a 2 ) G r 2 (?)} C Z. 
Consider the cardinality of the set D = {Reg(s + A') | A' < A}. 



(the assumption being < 5(s, (A,a 2 )) 



a) 
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• If \D\ < 3 then 

{S(s, (A',a 2 )) | A' < A, Succr 3 (s, A') = true, and (A',a 2 ) G r 2 (s)} 
={6(s, (A',a 2 )) | A' < A, and (A',a 2 ) G T 2 (s)}. 

Hence s G CPrei(Z) in this case (when \D\ < 3) by the second condition of CPrei(Z). 

• Suppose \D\ > 3. Since (A, ai) G Ii(s), we must have that there exists A' < A such 
that (A, J_i) G ri(s) and |{Reg(s + A") | A" < A'}\ = 3. 

Consider the player-1 move (A', J_i). As (A,ai) G ri(s), we must have (A',_L 2 ) G 
T 2 (s). Also, since 

{5{s, (A",02» | A" < A, Succr 3 (s, A") = true, and (A",a 2 ) G T 2 (s)} C Z, 

we must have 5(s, (A', -L 2 )) G Z. Since A' is such that Succr 2 (s", A) = false, we 
have 5(s, (A',_L 2 )) = 5(s, (A', J-i)) (the bl\ component is false in both cases). Hence 
S(s, (A', J_i)) G Z. Also, since 

★ {5(s, (A",a 2 )) \ A" < A, (A",a 2 ) G T 2 (s) and Succr 3 (s, A") = true} C Z and 

* |{Reg(?+A") | A"< A'}| =3, 
we have 

{6(s, (A",03» | A" < A', and (A",a 2 ) G T 2 (s)} C Z. 
Hence s G CPrei(Z) in this case by the second condition of CPrei(Z) (the winning 
move of player 1 being (A',_l_i}). 
Thus, in all cases s G CPrei(Z) whenever s G CPrei j3 (Z). □ 

Proof of Lemma 13. Ill 

Proof. Recall that by Equation 13.11 mentioned in Theorem 13. 101 given A = X\ x {1} U A 2 x 
{2} C S* , we have 

RegStates fore? fore? (A))) = RegStates fore? fore? (X l x {1}))) 

Similarly for the structure T f * , given X* = X{ a x {l a } U X^ x {l c } U A 2 * x {2} C S f * , 
we have 

RegStates (Pref * (Wf * (Wf * (A*)))) = RegStates (Pref * (pref * (Pref * (A^)))) 

1 (5.1) 

Thus, it suffices to show that if X 1 x {1} C S f and Aj* x {l a } C S f * are such that 
RegStates(Ai x {1}) = RegStates(A 1 * a x {1}), then 

RegStates (Pre? (Pref(Ai x {1})))= RegStates (Pref * fore? * fore? \x{ a x {la})))) • 

Note that RegStates(Ai x {1}) = RegStates(A 1 * a x {1}) implies Ai = X{ a C S Reg , where 
5*R e g is the set of regions of T^ d . 
(=>) For 

s / = (i?,l) G p re f (Pref (A lX {l})) 

we show 

sT = (R,l a ) G Pref* (Pre?' (Pref * (A* a x {l a }))) . 
Let player 1 choose to go to (R,j, ai, 2) from We can have have two cases: 
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— a± = _L*. 

If this is the move of player 1 for demonstrating that G Pre^ ^Pre^ (Xi x {1})^ , 

it means that every player-2 move from (R, j, a\ , 2} goes to X\ x {1}. For this to 
happen, we must have that in the game structure 7^ bl , for every state s G R, for 

every player-2 move (A, 02) such that Succr 3 3W (s, A) = true and (A, 02} G ^(s), 
we have S(s, (A,a 2 )) G -R' with (R f , 1) G Xi x {1}. 

Consider = (R, l a ) in S* . Let player 1 choose to go to = (R^)- Because 
of the restriction of player-2 moves from R proved above (that every player-2 move 
within two successor regions must lead to a state such that the region corresponds 
to a region in X\ and hence in X* ), we must have that in 7?* every player-2 edge 

from s f * 2 leads to a state (R',l d c um ) such that R' G X± = X{ a . Let X{ c = X x . 
We now have that every player 2 edge from leads to a state in X\ c x {l^ um }. 
The next state in the game will obviously be in X\ x {l a } = X* a x {l a }. Thus, 

s f * = (R, l a ) G Pref* (Vref * (Pre[ f * (X* a x {l a }))) if the first move of player 1 
corresponds to _L*. 

- a\ / L*. 

This case can have two subcases according to the available move for player 2. 

— This subcase corresponds to the case where player 2 does not "allow" the player-1 
move in a corresponding stage in 7f b[ . We must have fact 1 : that for every s G 

f" td _ 

R, for every player-2 move (A, 0,2) such that (A, 0,2) G r 2 3W (s) and |{RegJ+ A' | 

A' < A}| <j + l, we have that 5 T 3« (s, (A,o 2 » G i?' such that Reg(i2') G Xi. 

— This subcase corresponds to the case where player 2 "allows" the player-1 move. 
We must have fact 2 : that for every s G i?, there exists a player-1 move (A, 01) 

T ntd 

such that (A,ai) G F 1 3W (s) and |{Regs + A' | A' < A}| = j + 1, we have that 

<5 T 3w (s, (A,ai)) G R' such that Reg(i?') G X x . 
Let Xj* c = X\ = X* . Consider = (R,l a ) in S 1 ^ . Let player 1 choose to go 
to (R,j,2). we show player 1 can ensure going to X* a x {li a } with this choice. 
Player 2 can do two things from (R,j, 2). 

• Player 2 decides to "not allow" the player-1 move in a corresponding stage in 
T^ d . This corresponds to player 2 taking an edge to a state (R', l^"™} in 
from sf* which can happen if for every s£i?, there exists a player-2 move (A, 02} 

such that (A, a 2 ) G T 2 3bl (s) and |{Reg s + A' | A' < A}| < j + 1. By fact 1, we 

must have that for every such (A, 02), we must have J sm (A, 02)) G R' such 
that Reg(i?') G Xi. Hence, every "not allow" player-2 edge in from s?* leads 
a state in Xj* c x {lc Um }, the next state from which will lie in X* a x {l a }. 

• Player 2 decides to "allow" the player-1 move in a corresponding stage in 7^ . 

t * f * """" 

This corresponds to player 2 taking the edge from s-* in to the state (R, j, l c ) . 
From (J?, j, l c ), because of fact 2, player 1 can pick the edge corresponding to 
the action a\ which will lead it to a state in X\ x {l a } = X* a x {l a }. 
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Thus, if a± ^ _L*, playerl has a strategy to go from to X{ a x {l a } in three steps. 
sf* = (R,l a ) € Pref * (Pref* (Pref * (X* a x {l a }))). 
Hence in both cases, we have that 

sf" = (R,l a ) € Pre?'' (Pref* (Pref (X* la x {!.}))) . 

(<=) For 

a'* = (S,l > € Pre?'* (Pre?'* (Pref ' x {1 B }))) 

we show 

s / = (^,l} G P r ef (Prefix {1})). 
The following cases arise depending on the move of player 1 from s^* which witnesses 
a /*€Pref (Pref* (Pref * {l a }))) . 

• Player 1 moves to (R, 2) (this corresponds to a move ai = in 7). 

This means that every player 2 move from {R, 2) goes to X^ a x {l a }- Since X* = X±, 

we have (R, 1) G Pre x (Pre x (X\ x {1})) if the first move of player 1 from s } 
corresponds to _L*. 

• Player 1 moves to (R,j,2). This case has two subcases. 

— Suppose we have fact 3 : every player 2 move from (R,j, 2) to (R 1 , l^. um ) is such 
that fl'elf. 

Consider the situation in 7$ where player 1 moves to some (R, j, a\ , 2} with a\ ^ 
_L* and player 2 takes an edge corresponding to it not allowing player 1 in 7. 
Because of fact 3, all such edges must go to X\ x {1}, no matter the action a\. 

— Suppose we have fact 4 : every player 2 move from (R,j, 2) to (R, j, l c ) is such 
that (R,j,l c ) € Pre? (^* a x {la})- This means that in 7, from any state in 
R, if player 1 is allowed to let time elapse to get to the j-th successor from R, 
then it can take a discrete action a\ such that the resultant state will be in some 
region in X\ = X* . Thus, there must exist a player-1 action a\ such that from 

the state (R, j, a±,2) in 7*, all player-2 edges, which correspond to it allowing a 
move in 7, end up in X\ x {1}. 
Because of the two above subcases, if the witness for 

S /*GPref* (Pref* (Pref* x {l a }))) 

in the structure 7?* is a move to (R,j,2), then in the structure 7?, there exists a 
move of player 1 to a state (R,j,ai,2) such that {R,j,a\,2} £ PreJ f (Xi x {1}). 
Thus in both cases of player-1 winning moves in from , we have the existence of 
a player-1 move from which witnesses that = (R, 1} € Pre?' (Pre?' (X± x {1})^ . 

□ 
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5.3. Proofs of Section [4]. Completion of analysis of Example 14.91 
Example \4-9\ (continued). Let the game start from the location 1°. In a run r, let t[ 
and t\ be the times when the a^-th transition and the a^-th transitions respectively are 
taken for the j-th time. The constraints are t\ — 4 1 < 1 and 4 ~ 4 1 > 1. If the game 
cycles infinitely often in between 1° and l l we must also have that for all j > 0, > 
tL > ij. Conversely, we also have that if this condition holds then we can construct an 
infinite time divergent cycle of Z ,/ 1 for some suitable initial clock values. Observe that 
i = $ + (4 - 4) + (i? - 4) + ■ ■ ■ + (4 - t 1 ) for i G {1, 2}. We need 

m 

t ra+l _ t m = _ £ mj + {(4 - - (4 - 4" 1 )} + (4 ~ 4) > for all m > 0. 

Rearranging, we get the requirement 

m 

E{(4 - - (4 - 4' 1 )} < - 1?) + (t? - 4). 

Consider the initial state (Z°,x = y = 0). Let i§ = l,t° = 1.1, - 4 _1 = 1,4 - 4 1 = 
1 + 10-( j+1 \ We have 

m oo 

Thus, we have an infinite time divergent trace with the given values. Hence (Z°,x = 
y = 0) G WinTimeDiv^(n(-i/ 3 )). It can also be similarly seen that {l°,x = y = 1} G 
WinTimeDivi(D(^Z 3 )) (taking = and ^ = 0.1). We also have (l°,x = y = 1) <£ 
RobWinTimeDivi(n(-iZ 3 )) as player 1 would have to take the first a\ transition immediately 
from this state. 

We now show = y = 0} G RobWinTimeDivi(n(-i/ 3 )). Consider 

i$ G [0.9, 1],4 - 4" 1 G [1 — 10- (J+1) , 1] 
4 G [1.05,1.1] 

4 - 4~ l G I 1 + °- 5 * io^ (j+1) , i + 

We have 

E{(*2 - 1 1 ) - (4 - 4' 1 )} < E 10 ~ y+1) - (-io- (j+1) ) 

oo 

< 2 * y io-^' +i ) 
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We also have (i™ +1 - tf) + (t? - 1§) > 1 - l(H m+2 ) + (0.9 - 1.1) > 0.7. Thus, we have 



This shows that we can construct an infinite cycle in between 1° and I for all the values in 
our chosen intervals, and hence that (l°,x = y = 0) € RobWinTimeDivi(n(-i/ 3 )). 

We next show that (/°, x = y = 0} ^ JRWinTimeDiv^ J ' £r (n(-i/ 3 )) for any e- } > 0. Observe 
that for any objective <£, we have JRWinTimeDiv^ J ' £r (<I>) C JRWinTimeDiv^ J '°(<I>). Let ej = e 
and let e r = 0. Consider any player-1 strategy n±, for e-jitter 0-response time bounded- 
robust winning, that makes the game cycle in between 1° and I 1 . Player 2 then has a 
strategy which "jitters" the player-1 moves by e. Thus, the player-1 strategy tt\ can only 
propose a\ moves with the value of x being less than or equal to 1 — e (else the jitter would 
make the move invalid). Thus, player 2 can ensure that t\ — t^" 1 < 1 — e for all j for 
some run (since x has the value t{ — t^ -1 when a\ is taken for the j-th time for j > 0). 
We then have that for any player-1 strategy that is a candidate for e-jitter 0-response time 
bounded-robust winning, player 2 has a strategy such that for some resulting run, we have 



t{-t{ 1 < 1 - e and t{ - t^ 1 > 1. Thus, J2f=i { (4 ~ ~ (*i ~ A 1 ) } > m * e > which 



can be made arbitrarily large for a sufficiently large m for any e and hence greater than 

ym+l _ t m) + _ t 0) < l + (£0 _ for any initial yalues Q f j0 and j0 Thig vio l ates t h e 

requirement for an infinite 1° J 1 cycle. Thus, (l°,x = y = 0} ^ JRWinTimeDiv^°(n(^/ 3 )) 




j'=i 



< 0.7 



<(C + -C) + (il-*2)- 



for any e > 0. 



□ 
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